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THE VELOCITY OF CONDUCTION IN NERVE FIBER 
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The theory developed in this paper shows that the propagation of 
spike potential along a nerve fiber and the conduction of an electric 
wave along an inert inorganic conductor follow a common quantitative 
relationship. This result gives further support to the belief that propa- 
gation of excitation is an electrical process. The basic idea of the the- 
ory is derived from the consideration that velocity has, by its mathe- 
matical definition, a local meaning; conduction in a nerve is completely 
determined by the local characteristics of the latter, as well as those of 
the wave. The final formula derived does not make use of any other field 
of science beyond the fundamental principles of electricity. It gives 
the conduction velocity in terms of the electric characteristics of the 
fiber and of the duration of the spike potential. The formula is in agree- 
ment with the known dependence of the conduction velocity on various 
parameters characterizing the axon. The computed velocity agrees with 
the measured ones on the squid giant axon, crab nerve axon, frog mus- 
cle fiber and Nitella cell. The membrane inductance appears as a ve- 
locity controling agent which prevents also a possible distortion of the 
spike potential during conduction. The structural meaning of the elec- 
tric characteristics of the axon membrane is discussed from the view- 
point of the diffusion theory. A formula for the velocity of spread of 
the electrotonus is also derived. 


The object of the present paper is: (i) to derive a formula for 
the velocity of propagation of excitation in terms of the electric 
characteristics of the axon and of the duration of the spike poten- 
tial; (ii) to discuss a possible structural meaning of the known data 
on the electrical properties of the axon; (iii) to examine the veloc- 
ity of spread of the subthreshold potential from an electrical view- 
point. 


1. Axon as an Electric Circuit. 


We base our theory on the following well-known electric cir- 
cuit representing a nerve fiber (Fig. 1) (see, e.g., Cole and Baker, 
1941; Curtis and Cole, 1944; Hodgkin and Rushton, 1946; Davis and 
Lorente de Nd, 1947). The axoplasm and the external medium are 
considered as distributed longitudinal resistances of r;and rz; ohm/cm 
respectively, in parallel with respect to each other. These resistances 
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FIGURE 1. Electric circuit of an infinitesimal element da of a fiber. 


are connected transversally by a uniformly distributed membrane 
impedance consisting of a capacity of ¢ farad/em in parallel with a 
resistance of ry ohm cm and with an inductance of L henry cm, both 
per unit length of the fiber and both in series with each other. AI- 
though inductance is considered in the formulas, we do not exclude 


the case L = 0; some of our conclusions are practically independent 
of L. 

2. Wave propagation in an Electric Conductor and Conduction 
of Excitation in an Axon. 

The theory of electricity offers a simple mathematical descrip- 
tion of propagation of a wave caused by a harmonic electromotive 
force applied at one end of a semi-infinite conductor (see, e.g., Pen- 
der and Warren, 1943, pp. 201-09). In various theories axons have 
often been considered to be of infinite length (Rashevsky, 1948; 
Hodgkin and Rushton, 1946; Davis and Lorente de Nd, 1947 ) and 
this approximation seems to be quite good under appropriate experi- 
mental conditions. The type of conductor that is usually employed 
in electrical engineering is somewhat different from the circuit rep- 
resented in Figure 1. This is mainly because in transmission lines 
the inductance is in series with one of the longitudinal conductors 
and not in parallel with the capacity (see, e.g., Knowlton, 1949 pp: 
1184-86, 2047-49; Pender and Warren, loc. cit.). For this reason 
we have to derive first of all a formula for the propagation of a 
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simple harmonic perturbation in a conductor such as that repre- 
sented in Figure 1. We consider the steady state and will justify at 
the end of this section, as well as in the following sections, the pro- 
cedure of considering the conduction of excitation as a propagation 
of the simple harmonic perturbation in the steady state. We use the 
method of complex variable of the theory of alternating currents 
(see, e.g., Pender and Warren, 1943, chap. vi). If V is the complex 
voltage across the membrane, that is, between A and B (Fig. 1), we 
have, in the steady state, 


OV (OX ie V mawith ik? = (7,4 #;)/ 2), (1) 
where z is the complex impedance of the membrane, per unit length: 
Ly2 SS 8 (ry? ae «@?L?) =e jo le == IL, (1? al worl?) | (2) 


with 7 = V—1 and w = 2a/T, the period of the voltage considered 
being T sec. Equation (1) can be obtained from a formula which 
has been used in the case of direct current [Cole and Hodgkin, 1939, 
eq. (8) ] by substituting for the membrane resistance 7y its complex 
impedance z. Equation (2) is simply an expression of the fact that ¢ 
is in parallel with r, and L (see Fig. 1). We put V(+o) = 0, 
V(0) =V,. Then the solution of equation (1) is: 

V(x) =V.e(cosBu—jsinfx), wherek=at+j6, (8) 
a and f£ being two positive quantities and x the distance measured 
along the axis of the conductor. Since k? = a? — f? + 27 a B, from ex- 
pression (1) of k? and from relation (2), a system of two equations 
in a and # is obtained, which, putting 


Tu 2 ( @ L ) lis 
= | ( —* — } +( we -———_- 4 
. ( Ty == —) re Tue SF wl? ( ) 
gives 
Get Ti Tx ‘lie 
=| —-“"(s—-—_*—_}] . 5 
P 2 (s Ly? =F arlbe ( ) 


The quantity a is given by the same expression as $ except that a 
plus sign appears in the parenthesis, after S, instead of a minus 
sign. 

Since we consider a simple sinusoidal electromotive force at 
x# = 0, we can write for the instantaneous value of the voltage Vo: 


\V.| sin of . (6) 


Of course, the assumption of a harmonic electromotive force at 
x = 0 should not be interpreted as an analogue of a similar electro- 
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motive force actually acting in an axon. This electromotive force is 
introduced here only for the purpose of deriving a simple formula 
for the velocity of conduction of a perturbation in a conductor whose 
elementary electrical characteristics are the same as those of an axon 
(Fig. 1). The reasons why this formula is applicable to the velocity 
of propagation of excitation will be given later. 


a 
) 


a 


FicureE 2. Vector diagram of equation (3). OP |Vo| e-@ cos Ba, 


PQ = |V,| e-@ sin Bx. The instantaneous value of the voltage V is given ac- 
cording to equation (8) by the projection of OQ on aa. 


Relations (3) and (6) can be represented by the vector diagram 
of Figure 2 in which all the vectors rotate around O with an angular 
speed of w radians per second and the instantaneous value of the 


voltage V is given by the projection of OQ on the axis aa. This is 
a known representation of sinusoidal quantities by rotating vectors. 
Use is made here of the fact that multiplication of a vector by —j 
means its rotation by —90°. Consequently, using relations (3) and 
(6) we see the instantaneous value of the voltage V at a moment t 
and point « is (cf. Fig. 2): 


|Vo| e-* sin(wt — Bx). (7) 


The coefficient a represents the damping of the wave owing to dissi- 
pation of energy in the resistances. If we neglect the damping, ex- 


pression (7) has the form of a wave propagating with a constant 
speed equal to: 


v= w/b =2n/BT. (8) 


The main objective of the present paper is to show that formula 
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(8) expresses with good approximation the velocity of conduction 
of the spike potential in an axon, where T is the duration of the spike. 
One of the first questions which presents itself in this connection is 
the following: How can a formula for the velocity of propagation of 
a sinusoidal wave in an inert conductor be used to calculate the ve- 
locity of conduction of a spike potential in an active axon and why 
can the damping be neglected in this calculation? 

To answer this, let us examine first the physical conditions deter- 
mining the velocity of conduction of sinusoidal wave in a conductor. 
The function V which we have considered represents the potential 
difference between A and B (Fig. 1). However, except for a pro- 
portionality factor and possibly also for a different reference po- 
tential as zero potential, V is also the potential in the extrapolar re- 
gions of the external resistance [Davis and Lorente de No, 1947, for- 
mula (14); Hodgkin and Rushton, 1946, formula (1.4) ; Hodgkin, 
1947b, p. 324]. Therefore, except for terms which do not affect the 
wave character of expression (7), the latter represents not only the 
voltage across AB but also the potential in the extrapolar regions 
of the external medium. Figure 3 gives the diagram of this poten- 


C=CONST 
Diy ee D —_ Vv 
M, Mz 
eHrReare aires 
—__———> x 


Figure 3. Diagram of a sinusoidal voltage wave as an analogue of a spike 
potential, monophasic if the reference line is taken in M,M,, diphasic if it is 
taken in DD. The arrows below the curve give the directions of the local cur- 
rents. 


tial at a fixed moment ¢ in the case of an inactive conductor (rep- 
resented by Fig. 1) which has a negligible damping. This diagram 
represents a wave moving at a constant speed v. The arrows below 
the curve represent the directions of the local currents in the exter- 
nal medium at the moment ¢ considered. For simplicity, these ar- 
rows are drawn on the assumption that the phase difference between 
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current and voltage is negligible; the directions of the arrows ex- 
press then the elementary fact that current flows in the direction 
of decreasing potential. Let us consider one wave length, for in- 
stance, that lying in M,M, (Fig. 3). What are the quantities on 
which the rate of advancement of this wave length depends in a direct 
manner? Since velocity involves distance and time up to infinitesimals 
of first order only (v = dz/dt), the rate of advancement of the part 
of wave M,M, at the moment ¢ is determined completely by the 
conditions of the conductor from M, — dx to M. + dx during the 
time from ¢ to t + dt. These conditions are expressed by the values 
of the electric constants which that part of the conductor has during 
that time. Conditions of the parts of the conductor which between 
the moments t and ¢ + dt are distant from M,M. by more than in- 
finitesimals of first order in x do not affect the velocity of the part 
M,M, of the wave at the moment t. (For other applications of this 
type of argument to the velocity of conduction in nerve see Opatow- 
ski, 1950, 1951). One could think a priori that the velocity of pro- 
pagation of the wave might depend on the magnitude of the voltage. 
Formulas (8) and (5) show that this is not the case. In the physio- 
logical field there are indications that conduction velocity does not 
depend even on the type of stimulus. R. J. Pumphrey and J. Z. Young 
(1938) mention that a drop of sodium citrate solution produced an 
impulse in a squid giant axon which traveled at about the same speed 
as one induced by electrical stimulation. 

Now the conjecture may be advanced that the velocity of con- 
duction of a spike potential is expressible by the same formulas (8) 
and (5) if the electric characteristics of the active part of the axon 
are used in these formulas and T is interpreted as the duration of 
the spike. In fact the part M,M. or DD of the wave shown in Fig- 
ure 3 is quite a good, although crude, approximation of a spike po- 
tential, monophasic if the voltage reference line is taken as in M,M., 
diphasic if it is taken as in DD. The fact that in an axon a single 
piece of wave, for instance, M,M,, appears whereas in Figure 8, 
which represents an electrical phenomenon in a purely inert conduc- 
tor, that piece is preceded and followed by many other identical wave 
pieces is immaterial as far as the velocity of conduction of M,Mz is 
concerned. This velocity is independent of what happens beyond 
M.M, except for infinitesimals of first order, as already explained. 
Thus what happens within one wave length of a conductor through 
which alternating current flows is a good formal analogue of what 
happens in an active part of the axon. As a matter of fact the direc- 
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tions of the currents drawn in Figure 3 for an ordinary conductor 
well represent the local action currents in the external medium. of 
the axon, if M,M., for instance, is taken as the spike potential (cf. 
e.g., Lloyd, 1947, p. 11). Thus the local action currents do not appear 
to be a purely biological phenomenon, since they exist in their essen- 
tial characteristics also in inert conductors during wave propagation. 

As to the reasons for considering damping to be negligible, al- 
though dissipation of energy must occur in electric resistances, one 
justification only seems to be possible. The internal energies which 
are believed to be released during excitation (cf. e.g., Curtis and 
Cole, 1944) compensate for the ohmic loss of energy in the ‘axon. 

Reasoning which has lead us so far to conclude that formulas 
(8) and (5) should express the velocity of conduction of'a spike po- 
tential is to a great extent of an intuitive character. It is a compari- 
son of these formulas with experimental facts that can determine 
the degree of confidence to be assigned to the former. This is dis- 
cussed in the following section. 

3. Experimental Test of the Formula for the Velocity of Con- 
duction in an Axon. 

Indicate with ¢ the diameter of the membrane and putting 
T, — abry, the resistance of the membrane in ohm cm’, L, = a¢L 
its inductance in henry cm? and c, = c/a¢ its capacity in farad/cm’, 
all three per unit area of the membrane. The velocity of: eds 
can be written as follows: 


5 : t 
ee 9 
is SA SORE 7 - 


where w = 22/T with T the duration of the spike potential, 7; and 7, 
the external and internal resistances per unit length, and — 


ee 2 wlig )° 2 
= —— + QS SSS 
os ( Tee ie (abe ) ( Gos Ory 


Ta 
(PaaS iy Oe 


(10) 


Until the present time there have been essentially three formu- 
las for the velocity of conduction in an axon; that of N. Rashevsky | 
(1933, 1948), of W. A. H. Rushton (1937) and of F. a A. Wein- 
berg and G. Young (1940). 

Experimental data covering mainly non-myelinated fibers indi- 
cate that 7, and c, do not vary greatly for axons of a given kind un- 
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der constant environment and excitability conditions (Cole and Cur- 
tis 1939, 1950; Hodgkin, 1947a; Hodgkin and Rushton, 1946; Cole 
and Baker, 1941; Cole and Hodgkin, 1939). This is particularly 
true for membrane capacity which seems to be contained within 
a narrow range, even for a broad variety of cells (Cole, 1947, 
p. 23). The spike duration T seems to be constant also for axons of 
a given kind under constant conditions. The mammalian nerve fibers 
offer a convincing example of this (Grundfest and Gasser, 1938; 
Gasser and Grundfest, 1939; Grundfest, 1940; Gasser, 1941; Lloyd, 
1947). Thus U seems to be independent of the size of the fiber and 
for non-myelinated fibers ¢ can be taken as the diameter of the axo- 
plasm and of the external medium at the same time, since the mem- 
brane is very thin (see sec. 4 and Cole, 1947, p. 22). Then 7; is in- 
versely proportional to ¢?. If rz is also taken approximately as in- 
versely proportional to ¢?, velocity appears to be from equation (9) 


directly proportional to \/¢. Formulas of Rashevsky and of Offner, 
Weinberg and Young lead to exactly the same conclusion which has 
been confirmed by experiments on the squid giant axon. These ex- 
periments gave a proportionality of v to ¢°°"* (Pumphrey and Young, 
1938) with about the same relationship also holding for other 
Cephalopods. 

In myelinated fibers of cat and rabbit nerve the velocity is a 
linear function of the diameter according to J. B. Hursh (19389) or 
slightly less than that according to H. S. Gasser and H. Grundfest 
(1939). However, the previous reasoning which led us to conclude 
that the velocity of conduction is proportional to the square root of 
the diameter does not apply here. The conductor AB, through which 
the current flows transversally (cf. Fig. 1) has been identified in 
non-myelinated fibers with the cell membrane, but in medullated 
fibers it includes also the myelin sheath, at least insofar as the re- 
sistance of this transversal part of the conductor is concerned kets 
Cole and Hodgkin, 1939; Rosenberg and Schnauder, 1923). But mye- 
lin may be as much as 25% of the diameter of the axon, and even 
more (see, e.g., Kiss and Mihalik, 1928) ; consequently ¢ cannot be 
taken at the same time as the diameter of the membrane, of the axo- 
plasm and of the external medium. If these three diameters were 
proportional to each other, v would be, theoretically, still propor- 


tional to \/$; however, experimental data for the fibers of the saphe- 
nous nerve of cat and rabbit show that the proportionality between 
the external and internal diameters holds only for a partial range of 
these fibers (Gasser and Grundfest, 1939). This non-linearity was 
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noted also by F. O. Schmitt and R. S. Bear (1937) on fibers of frog 
sciatic nerve, by Arnell (1936) on spinal nerve and was recently 
confirmed through extensive studies of G. W. Taylor (1940, 1941, 
1942, 1943), F. K. Sanders (1948), Sanders and D. Whitteridge 
(1946) on nerve fibers of various animals. As Schmitt and Bear 
state (1939), “... comparison of fibre velocity with diameter alone, 
without regard to degree of myelination, is apt to be confusing.” 
Aside from this it is doubtful whether the structure consisting of 
membrane and myelin sheath can be represented by the circuit AB of 
Figure 1 (see sec. 4). Assigning a special function to the Ranvier 
nodes in the conduction process, Offner, Weinberg and Young (1940) 
obtained a different argument for a deviation of the conduction ve- 


locity from the proportionality relation v = ¢. However, view- 
points on the role that Ranvier nodes may play in the conduction are 
conflicting (Curtis and Cole, 1940, p. 593; Rosenblueth, Wiener, Pitts 
and Garcia Ramos, 1948, p. 310; Grundfest, 1947, p. 488), and the 
assumption of Offner, Weinberg and Young that conduction velocity 
depends on internodal distance is not confirmed by the experiments 
of Sanders and Whitteridge (1946). 

Together with the formulas of Rashevsky, Rushton and Offner, 
Weinberg and Young the present formula has in common the in- 
verse square root proportionality of the velocity to the sum of the 
external and internal resistances. That the velocity of conduction 
should decrease when the external resistance increases was shown 
by A. L. Hodgkin (1939) on crab and squid giant axon using sea 
water, air and oil as external media. A quantitative indication of 
the correctness of this relationship was obtained by B. Katz (1947) 
using different electrolytic concentrations of the external medium of 
a crab nerve fiber. An indirect confirmation of the same relation 
was recently also given by Hodgkin (1947a), and in allied fields of 
muscle and Nitella cell the qualitative character of the relation has 
been known for some time (Pond, 1921; Auger, 1933). 

The discussion just presented implies the assumption that the 
spike duration T is independent of the external medium. Although 
no systematic data seem to be availabie on this subject, a few rec- 
ords published by Hodgkin (1939) indicate such independence. 

The relation (9)—(10) differs from the Offner, Weinberg and 
Young formula in the way that v is made dependent on the shape 
of the spike potential. The present formula makes v dependent on 
the duration of the spike, whereas Offner, Weinberg and Young make 
vy dependent on its magnitude at the inflection point of the rising 


phase. This inflection point is quite often unobtainable with suf- 
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ficient accuracy, since the spike potential sometimes is practically 
a straight line around that point. As a matter of fact H. S. Gasser 
and H. Grundfest (1939) calculated the spike potential of a nerve, 
in good agreement wits its actual shape, by assuming a straight tri- 
angular spike for the fibers (cf. also Lloyd, 1947, p. 102). On the 
contrary, the spike duration is obtainable with quite good accuracy. 
In addition to this, and still more important, is the fact that the con- 
duction velocity does not seem to be very much affected by varia- 
tions of spike duration (see numerical results of this section con- 
cerning crab axon and an approximate formula for the conduction 
velocity in Nitella cell). 

We proceed, now to a numerical computation of the velocity of 
conduction. Let us consider first a squid giant axon. We can take 
as an average value of the spike duration 7 = 2 msec (cf. Pumphrey 
and Young, 1938; Hodgkin, 1939; Katz, 1947). K. 8. Cole and H. J. 
Curtis (1940) give for a squid giant axon of 446u diameter in sea 
water rz + 7; = 37,000 ohm/cm. Let us consider a diameter of 292u 
for which values of spike velocity are available (Pumphrey and 
Young, 1938). We take: rz + 7; = 37,000 X (446/292)? = 86,400 
ohm/cm, L, = 0.2 henry em? for the membrane inductance, c = lu 
farad/cm? for its capacity, both per unit area (Cole and Baker, 1941; 
Curtis and Cole, 1944; Cole and Curtis, 1939). To see the effect of 
inductance and of capacity, calculations are made also with L, = 0 


and ¢ = 2u farad/cm?. Table I gives the results for various values 
of the membrane resistance. 


TABLE I 
CALCULATED CONDUCTION VELOCITIES 
OF A SQuID GIANT Axon 


Tg Oh Crh? Basis as 25 100 200 400 700 
v m/sec with 
¢,=14f/cm2 2 
paren } 12.9 18.5 13.9 13.3 11.9 
= a t/em2 | 
a 
1 ah, On ar 44.9 22.8 16.6 12.8 111 
¢,—=2uf/em2 
orate ae 7.3 7.5 7.5 7.5 7.2 


L, = 0.2 h.em2 


The experimental value of v given by R. J. Pumphrey and J. Z. 
Young (1938) for an axon of 292u is 14.9 m/sec. However, if the 
smooth curve given by the same authors is used, a magnitude of 
about 13.5 to 18.6 m/sec is obtained. This is in very good agreement 
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with the theoretical values of Table I, since the range of 25 to 400 
ohm cm? is very likely to contain the correct average membrane re- 
sistance of the active part of the axon (Cole and Curtis, 1939, 1940; 
Cole and Hodgkin, 1939; Cole and Baker, 1941). 


The results shown in Table I are quite significant. First of all 
we note that a change of the membrane resistance within its range 
as known through experimental data (Cole et al) has practically no 
effect on conduction velocity. (See the second line of Table I. Aver- 
age membrane resistances larger than 400 ohm cm? are not likely to 
correspond to axons under action potential.) This may explain why 
fluctuations in the measured values of 7, (Cole et al; Hodgkin and 
Rushton, 1946) seem to be higher than those of the conduction ve- 
locity. This result is in contrast with that of Offner, Weinberg and 
Young, which gives v as inversely proportional to \/7,. However, 
7, has in the present work a somewhat different meaning than in 
the Offner, Weinberg and Young formula. In the present formulas 
it is reasonable to consider r, as an average resistance of the active 
part of the membrane; in Offner, Weinberg and Young’s theory 7, 
is the local membrane resistance at the moment at which the poten- 
tial goes through its inflection point in the rising phase of the spike. 

Although 1u farad/cm? is a good representative average of mem- 
brane capacity, occasionally higher values have also been found. It 
is seen from Table I that increasing the capacity by a factor of 2 
reduces the velocity in the same proportion. This is in agreement 
with Offner, Weinberg and Young’s formula according to which con- 
duction velocity is inversely proportional to capacity. Qualitatively 
this relationship is intuitive since larger capacity implies a slower 
charging and discharging process which is believed to accompany the 
spike conduction. 

Another interesting conclusion that can be drawn from Table I 
is that concerning the inductance which is of small effect for higher 
membrane resistances but reduces the conduction velocity in a very 
pronounced manner at lower membrane resistances. In this way the 
conduction velocity is maintained practically independent of varia- 
tions of the membrane resistance. (Compare lines two and three of 
Table I). That such a velocity-regulating agent should exist in the 
membrane is suggested also by the following consideration independ- 
ent of the concept of nerve inductance: All formulas for the con- 
duction velocity which do not take into account the inductance, in- 
cluding the present one if L, = 0, agree on the fact that this veloc- 
ity depends on membrane resistance. Now it has been shown by Cole 
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and Curtis (1939) that membrane resistance changes during activity, 
lower resistances corresponding to higher degrees of activity (see 
also Hodgkin and Rushton, 1946, p. 477). Consequently, to each or- 
dinate of the spike potential corresponds a different membrane re- 
sistance (see, in particular, Fig. 10 in Cole and Curtis, 1939). There- 
fore, in the absence of a speed-regulating agent, each ordinate of the 
spike potential would move at a different speed, which would result 
in a distortion of the spike potential during conduction. As Table 
I shows the inductance slows down high conduction velocities and 
leaves low velocities practically unaffected. This may be explained 
by the fact that inductance stores the energy when its availability 
increases and releases it later in the decreasing phase. Through this 
speed controling function the inductance prevents distortion of the 
spike potential which would occur during conduction because of a de- 
pendence of the conduction velocity on membrane resistance and de- 
pendence of the latter on the local magnitude of the potential. Thus 
inductance would accomplish in a nerve the same function as it does 
in a telephone cable, although through a different electrical mecha- 
nism (Pupin coils, see, e.g., Knowlton, 1949). 

The velocity controling effect of the inductance appears clear 
from the following expansion of 1/.\/U to which v is proportional: 


1 1 1 i 
VU Ben oe, 2wC, wla 


‘i ( oul ) 1 
Ab A. + fi s — CHCaCE ae 
ZC, N & Gg (wL,)? | 


This expansion [see eq. (10)] is valid for large inductances. An 
infinite inductance would make the velocity exactly independent of 
the membrane resistance (and inversely proportional to the square 
root of the capacity). A finite, but large, inductance would not change 
this independence of velocity and membrane resistance by very much, 
since the latter does not appear in the above expansion below the 
term of the second order. Even there its effect is not very large, 
since for the squid giant axon just considered 3/4ac, is about 250, and 
the average membrane resistance during activity is certainly not 
more than that. As a matter of fact, an infinite inductance would 
give velocity only about one-third lower than the actual one found 
with 0.2 henry cm?. Thus a possible meaning has been found for the 
very high inductance which has been noted in the squid giant axon 
membrane (Cole et al): the velocity regulating function of the in- 
ductance requires a large amount of the latter. 
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The effectiveness of the inductance to keep the conduction ve- 
locity independent of variations of membrane resistance appears 
clear if one considers L, = 0. Then: 


O Sasi a 4 Cp?) 7? Sr, . 


Since in the present case we, is about 3 X 10° farad/cm? sec and 7, 
does not exceed 100 ohm cm? by very much, we obtain by power se- 
ries expansion: 


Oe (2/12) ees (wez) ’ 


which is infinite for 7, = 0 and equal to about 47 for r, = 100. A 
large inductance levels off this variation to a constant value of 
he ac, = 183 

Of course there is no reason to believe that this may be the only 
effect of the inductance. It has been noted, for instance, (Eccles, 
1948, p. 102) that axons have a decreasing tendency toward oscilla- 
tory responses in the face of increasing membrane resistances; this 
may also be a combined effect of inductance and membrane resistance. 

The question of how the relatively high inductance of 0.2 henry 
cm? can be located in such a small structure as an axon membrane 
is difficult to answer in a definite manner. Cole and Baker. obtained 
evidence for the existence of an inductance by applying the theory of 
linear electric circuits. Since ions are the carriers of the current in 
the nerve, the existence of a high inductance means, within the frame- 
work of that theory, the existence of ionic paths which give a high 
magnetic flux through the membrane. Cole and Baker found that 
fresh axons never had inductance and that this arose, in a great ma- 
jority of axons, locally at least, within one hour or so after excision. 
One could think, therefore, that the conductivity changes, which are 
known to occur after excision and can take place only through the 
intermediary of ions, simultaneously hring about the establishment 
of ionic paths across the membrane which confer to the latter the 
property of a relatively high inductance. As Cole suggested (1947) 
changes in conductivity and appearance of the inductance may be 
closely associated with each other. However, computations of the in- 
ductance by standard methods of the theory of electricity (sec. 4) 
do not indicate the existence of such ionic paths. Consequently, more 
plausible seems to be the conjecture that the circuit is actually non- 
linear and behaves, within a certain range, as if it were linear and 
had an inductive membrane (Cole, 1941, 1947). As a possible origin 
of such non-linearity appropriate electrochemical reactions (Bart- 
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lett, 1948) and piezoelectric effects (Cole, 1941) have been sug- 
gested. 


TABLE II 
CALCULATED CONDUCTION 
VELOCITIES OF A CRAB AXON 


7, ohm cm? 25 100 1000 7653 


v m/sec with 


Tie" -2Msec 8.38 4.35 2.46 2.28 
0 

T =1.2 msec 2.47 2.49 2.A7 2aeAy) 
i 0:2 niem2 

T =2 msec 8.43 4,25 2.01 1.78 
a0 


Table II gives the results of calculations on a crab nerve axon 
of 31.8 uw diameter for which data of Hodgkin (1947a) are available; 
for an axon in oil 7;/rg = 1.58, resistivity of axoplasm = 90 ohm 
cm, 7,4 = 7653 ohm cm? (axon in resting conditions), c, = 1.112 u 
farad/cm?. This gives rz + 7; = 1.85 X 107 ohm/cm. From another 
paper by Hodgkin (1939) we take T = 1.2 msec for an axon under 
similar experimental conditions. It is seen from Tables I and II that 
the relative effect of an inductance of 0.2 henry cm? would be about 
the same here as in the squid giant axon. Although the only meas- 
ured inductances that are available are those of the squid giant axon, 
Hodgkin and Rushton (1946) noted in some lobster axons a possible 
inductance of 0.3 henry cm?. 


The experimental value for conduction velocity in oil of a crab 
axon of about 30u diameter, as determined by Hodgkin, is 3.5 m/sec. 
This suggests, according to Table II, that probably no inductance 
existed in the membrane and that its average membrane resistance 
was somewhere between one hundred and a few hundreds of ohm cm:?. 
Table II also contains results with a spike duration of 2 msec, which 
has been observed on crab axon by Hodgkin and Rushton (1946, p. 
477). It is seen that a change of T from 1.2 to 2 msec has practically 
no effect on v , particularly in the pertinent range of 7,. 

For muscle fiber the data are less complete. Using the values 
given by B. Katz (1948) for the Extensor Longus Dig. IV of Rana 
temporaria, 6 = 45u, ¢, = 4.5u farad/cm?, 1;/rz = 1.08, resistivity 
of fiber interior 260 ohm cm, and taking tentatively T = 5 msec, we 

obtain v = 0.4 m/sec for 7, = 4,000 ohm em? and v = 0.7 m/sec for 
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74 = 100 ohm cm?. The figure of 4,000 ohm cm? is indicated by Katz 
for the resistance of a resting membrane. During activity a much 
lower resistance may be expected. It is seen that the calculated ve- 
locity agrees with the measured one within a factor of about 2 (cf. 
Katz19489 po527) . 

We close this section with a calculation of the conduction ve- 
locity for Nitella cell. Since the structure of this cell is, from an 
electrical viewpoint, more complicated than that of an axon (cf. Os- 
terhout, 1943), the diagram of Figure 1 cannot give more than crude- 
ly approximated results. The diameter of this cell may be as large 
as 500u (Blinks, 1930; Curtis and Cole, 1937) and we base our cal- 
culations on a figure of ¢ = 400u. The duration of the action poten- 
tial may be as large as about 15 sec (see, e.g., Blinks, 1930; Oster- 
hout, 1935, 1936; Hill and Osterhout, 1935; Osterhout and Hill, 1940) 
and we take tentatively T = 10 sec. We will see later that the order 
of magnitude of the quantities involved is such that the exact value 
of T is immaterial. A capacity of lu farad/cm? seems to be as jus- 
tified here as in the case of an axon (Curtis and Cole, 1937; Cole and 
Curtis, 1938). We may indicate for a typical resting membrane re- 
sistance 250,000 ohm cm? (Blinks, 1930; Curtis and Cole, 1937). 
However, at the height of activity the membrane resistance may de- 
erease by a factor of 200 (Cole and Curtis, 1938); consequently, 
we take as an average value for the whole active part of the cell 
1%, — 10° ohm cm?. For the sum of the external and internal resist- 
ances we assume the round figure of rz + 7; = 10° ohm/cm on the 
basis of data of Blinks (1930). Then 7,?m’c,” is so small with respect 
to unity that we can write, since we neglect L., 


Gaal or 767 te — 1) ra wre fa. 


Consequently v ~ 2/[¢aV/a¢(1%2 + 11)7a] , which is independent of 
w. This approximate formula shows the same type of dependence 
of v on c, and r, as Offner, Weinberg and Young’s formula. However, 
according to these authors, their formula is not in agreement with 
the experimentally observed conduction velocity of Nitella. Substi- 
tuting the previously mentioned numerical constants into the last 
equation, we obtain v ~ 5.6 cm/sec, which falls well within the ex- 
perimental range of 2 to 20 cm/sec (Osterhout, 1936). 


4. The Electric Characteristics of Axon from a Structural 


Viewpoint. ; 
Since electric phenomena in the nerve are of an electrolytic na- 
ture, the electric properties of an axon are a consequence of its struc- 


292 VELOCITY OF CONDUCTION IN NERVE FIBER 


ture and of its ionic composition. This fact has been previously used 
to gain some tentative insight into the membrane (Cole, 1947, 1949). 
The capacity of a thin cylindrical shell of thickness A: is: 

C= K/4anadce, (11) 
where K is the dielectric constant. The theory of diffusion of ions 
in dilute solutions, together with the law of perfect gases, gives us 
the following expression for the conductivity of an electrolytic me- 
dium (Planck, 1890): 

Bo é?D ni/kT, 
where 7; is the number of ions per cm? of kind 7, e; its charge, D; its 
diffusion coefficient which does not need to be assumed independent 
of concentration, k the Boltzmann constant and 7 the absolute tem- 
perature. From the above we obtain, for membrane resistance per 
unit area, 


Tea— kTA/D e?Dini, (12) 


where A is the thickness of the membrane. If, together with this 
equation, formula (11)is used for membrane capacity, A equals A, 
on the basis of the electric circuit of Figure 1. However, there are 
theoretical and experimental reasons to suspect that the circuit in 
Figure 1 may be, at least in some cases, insufficient to represent the 
actual situation. Davis and Lorente de Nd (1947), Rushton (1937), 
Rosenberg (1937), Bogue and Rosenberg (1934) and others consid- 
ered the diagram of Figure 4, or some of its particular cases, as 
possible electric equivalents of the membrane. According to the cir- 


A 


B 


FicurRE 4. Electric circuit illustrating a possible transversal structure in an 
axon: a heterogeneous membrane or a myelin sheath with an adjacent membrane. 
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cuit of Figure 4 A, is different from 4. This differentiation between 
the two thicknesses may be viewed in the following manner from the 
standpoint of the theory of diffusion. It is known that one of the 
prominent characteristics of a living cell membrane is its heterogene- 
ous structure (see, e.g., Davson and Danielli, 1943) through which 
the membrane acts as a sieve for the diffusing substances. It is con- 
ceivable that internally in the membrane, or adjacent to one or both 
of its surfaces, there is a layer which is impenetrable to some of the 
ions because of their large size or small energy. These ions would 
gather on the boundaries of that layer conferring to the latter prop- 
erties of a capacity; smaller ions or those having a sufficient energy 
would penetrate the layer, giving rise to a conductance in parallel 
with the capacity. 


Another reason to separately consider a capacity thickness and 
a conductivity thickness in the transversal structure of the axon 
arises in myelinated fibers. Electrically, because of their low con- 
ductivities, myelin and membrane both behave as transversal struc- 
tures with respect to the axoplasm, and A in formula (12) is the to- 
tal thickness of this structure. In some of these fibers a high capac- 
ity has been observed (Curtis and Cole, 1950, p. 593; Cole and Cur- 
tis, 1950, p 88). However, the myelin is so thick that the capacity 
calculated from formula (11) by putting A = A, in it would be small- 
er than the measured one by more than an order of magnitude. Even 
for a fiber like the squid giant axon, which does not seem to have a 
myelin sheath in the ordinary sense, but does contain, according to 
Bear, Schmitt and Young (1937), a similar very thin lipoid struc- 
ture, calculations would give a capacity that would be much too low 
if A. in formula (11) were taken as the thickness of this lipoid 
sheath. In fact, according to these authors, the thickness of this 
structure is less than 1% of the diameter. Taking, for instance, 
A, = 2.5u, which would be 1/2% for an axon of 500u, and assuming 
K = 8 (cf. Cole, 1947) we would obtain 


C, = B/4a X 2.5 X 10-* X 9 X 10° wf/em? ~ 10° wf/em?, 


which is 1,000 times smaller than the experimental value. This gives 
support to the diagram of Figure 4 in which one of the two resist- 
ances either at A or at B may be zero. Following Cole (1947), formula 
(11) can be used to determine the thickness of the capacity layer of 
the membrane. With c, = 1uf/cm? the value for 4. is 27 angstrom. 
On the other hand, if the very low capacity of ¢ = 4.1 a. 8 
farad/cm obtained for the peroneal nerve fiber of the bullfrog of 
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10u diameter is considered (Davis and Lorente de No, 1947) a much 
thicker capacity layer would be required. From the formula for the 
capacity of a cylindrical shell we obtain: 


At (eke Le 1) ¢/2 e 


With K = 3 this gives A. ~ 2,000 angstrom. 
Since, necessarily, dc < A, relations (11) and (12) give: 


S e2Din; > KkT/4nria (13) 


For K = 3, T = 300°, c. = 1u farad/cm?, r. = 1,000 ohm cm’, the 
right-hand side of this inequality is 10-"t. Therefore if we call D the 
average diffusion coefficient and n the total ionic concentration in 
the membrane, and assume as monovalent the great majority of ions, 
we obtain Dn > 4.3 X 107 (cm sec)-?. If an axon is electrically ex- 
pressible by the circuit shown in Figure 1, the sign of equality is to 
be taken in this relation. It is difficult to estimate either one of the 
two factors on the left-hand side of the above inequality. Cole sug- 
gested (1949), on theoretical grounds, m = 5 X 10% ions/cm’. This 
would give approximately D > 10-1 em?/sec. 

The number of ions which behave statically at the capacity layer 
can be estimated in the following manner. If EF is the voltage across 
the capacity and Q the charge per unit area, we have Q = ¢,H# . Since 
Ca = 1[uf/cm? we obtain from this, for the number of monovalent ions 
at each boundary of the capacity layer, 


L038 X10°X 104 B/48 10 S16 .25 XeL0S i ones 


where i is now in millivolt. If these ions were arranged regularly 
ina single layer to form a network with a square-shaped basic unit, 
the distance a of the two closest neighbors would be: 


a= 1265/VE angstrom , 


which is a very reasonable relationship since for E = 100mV the 
distance would be a = 126.5 angstrom. 

The product 7.c, which appears in relation (18) can give us an 
idea about the rate at which ions flow through the resistance par- 
allel to the capacity, or, phrasing it differently, about the proportion 
of ions which succeed in penetrating through the capacity layer with 
respect to those which are too big or too slow to do so. In fact (cf. 
Cole, 1947) 

Q=c.h,V—=r 1, 


where V is the voltage across the membrane, F that across its ca- 
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pacity layer, so that E' < V, and I is the intensity of the current flow- 
ing across a unit area of the membrane. From the above two equa- 
tions we obtain: 


Qjl ==Cir /V S Cire. 


Here again the sign of equality holds if the axon corresponds to the 
circuit shown in Figure 1. Under this hypothesis for a squid giant 
axon in resting conditions c.7, ~ 10° sec, that is, per each ion ac- 
cumulated at the capacity layer there are 1,000 ions per second pass- 
ing through it. During activity this figure may increase by a factor 
of 10 or more. Thus the capacity layer seems to be quite easily pen- 
etrable by the ions and the fraction of slow or heavy ions seems to 
be relatively small. For a Nitella cell ¢, is about the same as for 
the squid giant axon, but 7, is much higher. With r, = 10° ohm cm? 
for a resting Nitella cell, ten ions only would flow per second for 
each one accumulated at the capacity. The discussion presented in 
this paragraph is based on an idea already considered by Cole (1947) 
although a misprint makes the conclusion in that paper (p. 51) ap- 
pear erroneous. 


We now proceed to the discussion of the inductance. An induct- 
ance of 0.2 henry cm? for an axon is very high. In fact, the longi- 
tudinal inductance of a circular cylinder of length / and diameter ¢$ 
is according to Neumann’s formula (see, e.g., Rosa and Grover, 1912, 
p. 150): 

21 [In (41/¢) — 0.75] X 10-° henry . 


For an axon of 400u diameter and 10 cm length this would give about 
1.2 X 107 henry, whereas 0.2 henry cm? gives a radial inductance 
two million times larger. This comparison suggests that if the 0.2 
henry per cm? of the membrane are due to a linear conductor the 
latter would have to give an unusually high magnetic flux. Helically 
shaped conductors or coils are the most efficient from the inductive 
viewpoint in the sense that they give high inductance within a small 
space. Consequently, one could think of only the following way, if 
any, to materialize through a linear conductor an inductance of 0.2 
henry cm? within the thin transversal structure of the axon. As it 
has been pointed out in section 3, changes in conductivity which oc- 
cur, for instance, after excision of an axon could suggest that new 
ionic paths are being established in the sheath of the axon. However, 
even if we assume that these paths are helically shaped, we find that 
they cannot give the required inductance. In fact, consider the fol- 
lowing expression for the inductance of a coil of length 4, having a 
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circular cross-sectional area A and N windings per unit length (Rosa 
and Grover, 1912, p. 116): 


4nN?AA X 10° henry . 
The inductance per unit area is 
L=4nN?A2A X 10-° henry cm?. (14) 


Consider a squid giant axon 400u diameter and let us make assump- 
tions favoring a high L: (i) N = 2 X 10’ cm, that is, a distance of 
5 angstrom between two consecutive windings; (i1) d= 4 X 10% cm, 
that is, thickness of the transversal structure of the axon equal to 
1% of its diameter, which is possible according to Bear, Schmitt and 
Young (1937). Then from formula (14) we find that to give 0.2 
henry cm? the cross-sectional area of the ionic path would have to be 
A = 0.01 cm? which is geometrically impossible for an axon of 0.04 
em diameter. In addition to this, even if it were possible it would not 
correspond to the circuit shown in Figure 1 which assumes a dis- 
tributed current throughout the whole transversal structure and not 
a guided one in a single circuit of large dimensions. 

Thus there is no other choice but to ascribe the apparent induct- 
ance to a non-linearity of the circuit. 


5. Kinetics of Electrotonic Phenomena. 


Occasionally the opinion has been expressed that electrotonic 
phenomena spread instantaneously (cf. Bogue and Rosenberg, 1934). 
The object of the present section is to examine the speed and the 
length of spread of the electrotonus, that is, of a subthreshold vol- 
tage, in an axon on the basis of its electric characteristics. This 
analysis is based on the theory recently developed by Hodgkin and 
Rushton (1946), and by Davis and Lorente de Nd (1947). 

If I is the intensity of the current from A to B (Fig. 1), that is, 
across the membrane, and I, the intensity of the polarizing current, 
both per unit length, we have for the voltage V across the membrane 
due to the current I, (Davis and Lorente de No, 1947, p. 448): 


O2V J Oki (‘2 =F r,)1 ae Trl ‘ 


If G is the capacity and rv, the resistance of the membrane, both per 
unit length, we have 


I=J + V/at, 
V(x,t) =ryJ (a ,t) + Lad /ot, 
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where t is the time and J is the intensity of the current through the 
conductive branch of the circuit from A to B, per unit length (Fig. 
1). Putting J = 0 for t = 0, we obtain from the last equation: 


1 t 
i eet =; exp (—He) V («,7) exp( +) dr 


which, substituted in the previous two relations gives an integrodif- 
ferential equation for V(x ,t). We limit ourselves to the case L = 0 
and consider a time independent polarizing current. Put: 


eae, 2 [teh 
V=olyrs Mape ures: 

aie 

{iy Oo ee ag Cr x 7” Tu 


so that y is a dimensionless measure of the voltage across the mem- 
brane, 7 and € dimensionless measures of time and length. Then we 
have (Davis and Lorente de Nd, 1947, p. 452): 


Pewee file wear) of (Sy 7) 5 (15) 


where & < 0 is the dimensionless abscissa of the anode. Since the 
cathode is at 6 = 0, then 


ly| i ly| 7 
BP, 7) =e erfe( vs )- ee eliO Np ty ole, 
Vit Vr 


erfe (u) aes [exp (—u?) du. 
Va My 


Let us consider from now on the cathodal extrapolar region 
— > 0. The spread ¢ = &* of a voltage » is obtained as the root of 
the equation 


2y(&, 0) =exp [—2(&— &,)] — exp (—26). (16) 
The velocity with which this voltage spreads is: 
aiden.’ 2 dé 


Ue as Ee , 
dt ¢\/fu(ve + 11) dr 


where for each voltage y, the derivative of d&/dr is obtained by an 
implicit differentiation of equation (15). We limit ourselves in these 
calculations to an infinite polar distance; then we have &, =— o, and 
y(€, 7) =—f(é,7). From here: 


298 

dr of /0& cies 

Ses =2\/nr| e erfc 
dé of /dr ( 


VELOCITY OF CONDUCTION IN NERVE FIBER 


g 5° 
—+ 7 )-2 [em (o+= 7 
Te a 


where & and 7 are related to each other by the equation vy = »(é, 7). 


At the cathode €=0, and: 


dé es 
dr 2Q/ar 
Where ert (7) e-—le——ertemqnn 


=-40rt, (V7) ’ 


The elimination of + between these 


two equations gives the dimensionless velocity d&/dz of the dimen- 


sionless voltage v. The result o 


f this elimination is represented in 


Figure 5. It is seen that low voltages spread with high speed. The 
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Figure 5. Velocity dé/dr of spread of electrotonic voltage v at the cathode. 
Both velocity and voltage in dimensionless units. Infinite polar distance. 


extent &* of the spread of the volta 


by putting €,=— w: 
f= 


ge is obtained from equation (16) 


$ In(—2y). (17) 
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Since » < 0, voltages which are small numerically have a large 
spread. €° = 0 for y = — 1/2 which is the maximum possible numeri- 
cal value of » (ef. Davis and Lorente de No, 1947, Fig. 3). Obvi- 
ously d&/d7 = 0 até = &. 

As an illustration, let us consider a series of data on squid giant 
axon (Cole and Hodgkin, 1939): ¢ = 540u, 77 = 61,800 ohm/cm, 
7; = 16,200 ohm/cm, ry = 4,110 ohm cm (which corresponds to about 
700 ohm cm’), ¢ = 0.17u farad/cm. Therefore: 


2/¢V ru’ + 71) = 6.57 m/sec. 


Table III shows some numerical results. For numerically larger 
y’s, Figure 5 has been used to get dé/dr; for numerically small »’s 
the asymptotic relations 


a = d& exp (—2”) 1 
yr -, Tear, — z= —( z 
I dr 2 x |r| 2x |r| 


have been employed. 


TABLE III 
VELOCITY V OF ELECTRONIC 
VOLTAGE V AT THE CATHODE 


I,uA —VmV v m/sec 
1 0.5 29.8 
1 5 1155) 
5 0.5 148 


Thus it is seen that only very low voltages spread with a high speed. 
The length of spread of these voltages is respectively 0.6, 0.08 and 
1 cm, the largest spread corresponding to the highest velocity v. 


SUMMARY 


The paper deals with the kinetics of propagation of excitation. 
It is not concerned with the mechanism through which the action po- 
tential arises. The basic idea of the paper is that the velocity of 
propagation of a perturbation is determined by the electric character- 
istics of the conductor and by some characteristics of the perturba- 
tion. It appears that as far as the spike potential is concerned, its 
duration, together with the electric properties of the axon, are suf- 
ficient to determine the velocity of conduction. The formula derived 
is in accord with the known facts on the dependence of this velocity 
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on various parameters. Computed conduction velocities for the squid 
giant axon, crab nerve axon and Nitella cell agree with experimental 
measurements. The theory indicates as a possible role of inductance 
that of slowing down high conduction velocities which would arise in 
the case of a decrease of membrane resistance. The membrane ca- 
pacity and conductance are examined from the viewpoint of the the- 
ory of diffusion. A quantitative discussion of the inductance con- 
firms the opinion expressed by other authors that inductance is prob- 
ably an effect of some non-linear electric characteristic of the axon. 
The velocity of spread of the electrotonic voltage is also analyzed. 

The author acknowledges efficient help given by Messrs. A. Bier- 
man, L. D. Cohn and L. T. Zimmer in numerical computations lead- 
ing to Figure 5. 
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The propagation of a transverse disturbance along a tubular mem- 
brane enclosing a fluid medium and embedded in another is considered. 
It is shown that the velocity of propagation of such a disturbance can 
be identified with the velocity of the conduction process of thin-sheathed 
nerve fibers. The required values of the associated parameters, tension 
and pressure, appear not unreasonable. The results obtained indicate 
that experimental observations on the relation between the conduction 
velocity and the fiber diameter, as well as the effects of longitudinal 
stretching and transverse squeezing on the velocity of the conduction 
process in nerve, may be correlated on such a basis. 


We consider the possibility of mechanical motion as a factor in 
the determination of the velocity of the conduction process in thin- 
sheathed nerve fibers. As indicated in the literature (Hill and 
Keynes, 1949; Tobias and Solomon, 1950; Solomon and Tobias, 1950; 
Tobias, 1950), there exists evidence to support the view that me- 
chanical motion is a concomitant of electrical activity in nerve tissue. 
A mutual coupling between electrical and mechanical changes can be 
realized, for example, through a piezoelectric or electrostrictive mech- 
anism. 

It will be demonstrated in the succeeding analysis that the ve- 
locity of propagation of a transverse disturbance along a cylindrical 
membrane might be identified with the velocity of propagation of 
the conduction spike of a thin-sheathed nerve fiber. Numerical val- 
ues of the requisite parameters are obtained during the course of 
the analysis. The picture developed is also in accord with other as- 
pects of the conduction process. However, since associated electrical 
phenomena have not been incorporated into the theory the results 
should be regarded as suggestive in nature at the present time. 

In order to develop a picture in which mechanical disturbances 
play a major role in the determination of the characteristics of the 
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conduction process of nerve tissue, it appears necessary to consider 
disturbances of a type which can be localized or guided along a cylin- 
drical membrane. It does not appear feasible to develop a theory on 
the basis of longitudinal acoustic waves. Such waves would not be 
guided either along or inside a tube if the values of the acoustic para- 
meters characterizing the external fluid are close to those of the in- 
ternal fluid, when the tube has the properties of a thin membrane. 
Furthermore, the velocity of propagation of longitudinal waves in a 
liquid medium such as water is about 1500 meters per second. This 
is of the order of 50 times greater than any observed conduction ve- 
locities in invertebrate nerve fibers. Since the density of the material 
present in the nerve is in the neighborhood of 1 gram/cm* the com- 
pressibility of the medium would have to be very much different from 
that of water solutions. Assuming that one could surmount these 
limitations, the variation of the conduction velocity with the radius 
of the fiber would lead to further complication. 

It is possible to free one’s self from all the above difficulties by 
developing a picture on the basis of the transverse vibrations of a 
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Ficure 1. (a) A transverse disturbance on a cylindrical membrane of cir- 


fan cross section. (b) Details indicating forces on an element of the mem- 
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tubular membrane enclosing a liquid and embedded in an external 
fluid medium. A possible role of shear waves in the nerve conduction 
process has been considered by W. Sutherland (1905, 1906-07, 1908- 
09). The investigations of O. Frank (1926) on the vibrations of the 
walls of blood vessels are of interest in this connection, but are not 
sufficiently general to include the analysis presented here. 

We consider a cylindrical membrane of circular cross section as 
illustrated in Figure la. The membrane is under tension, of magni- 
tude T per unit length, perhaps as the result of an excess internal 
pressure. Its mass per unit area is designated by o.f The density 
and acoustic velocity of the internal fluid are represented by p; and 
c; , respectively. The symbols p, and c, designate the corresponding 
quantities for the external fluid. We are concerned with the equa- 
tions appropriate for discussing transverse vibrations of small am- 
plitude on such a membrane with the associated disturbances in the 
surrounding media. It is sufficiently general for our present purpose 
to restrict the detailed discussion to a symmetrical disturbance, that 
is, one independent of the angle @ (Fig. 1). 

Let us consider a small section of such a membrane as illustrated 
in Figure 1b. Let € represent the displacement of the membrane in 
the radial direction and let A P be the difference between the internal 
and external pressures, i.e., 4 P = P’;, — P’-.. Then the forces act- 
ing on this element in the radial direction are 

: ; ; ~TA eS 
(i) Tr, APsinay +T 1; +az, AO SiN doo = 0 r( aba ). Az 


over the two arcs at Z and % + 42; 

(ii) ~--—T A2A9 

over the two elements parallel to the z axis, and 
(iii) APrAIGAZ=(Pw—P'o) PAIAZ 


because of the difference in pressure across the membrane. The dy- 
namical equation then follows: 


#E 1 | PotPeo_o #E 


— —— + — =. 1 

i T T ot i) 
Now the difference P’i) — P’eo can be written as 

P’9 — P’eo = APo + Pio — Peo ‘ (2) 


+For list of symbols and their definitions see page 314. 
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where P;, and Po are the deviations in pressure from the equilibrium 
values in the two media at the surface of the membrane caused by the 
disturbance on the membrane, and AP, is the pressure difference at 
equilibrium. The relation between the tension 7 and the equilibrium 
pressure difference AP, is given by 


= pe ae (3) 
Expression (1) then reduces to 


GENE Pow Bones 
— + —_=——. (4) 
Q 2? T Ter 
The same differential equation (4) is obtained if the tension is 
maintained by forces other than those arising from a hydrostatic 
pressure difference. Under the assumption that the displacement of 
the membrane is small, the equation appropriate for discussing the 
disturbance in the fluid media is the usual acoustic wave equation. 
We tabulate it here expressed in the symbols for the internal and 
external fluids: 


2 
v2 P,= 1¢ is 
Cr Ce 
(5) 
2 
vw P= 19 sib 
Ce Ont 


where P; and P, are the pressure deviations from the equilibrium val- 
ues at any point in the media. The effect of viscosity is neglected. 
Usually the inclusion of viscous terms affects the expression for ve- 
locity only slightly and introduces an amplitude damping factor. An 
indication of the magnitude of the damping, resulting from viscosity, 
for small amplitude disturbances of the type discussed might be given 
by the damping factor of an acoustic disturbance of the same fre- 
quency. On the basis of observations at ultrasonic frequencies, it 
appears that the damping factor for the frequencies and path lengths 
involved here would be small. Since the membrane is circular in 
cross section, we express the Laplacian operator in cylindrical coor- 
dinates. (See, for example, Morse, p. 296.) It is convenient first to 
discuss solutions to (4) and (5) which are sinusoidal functions of 
time. We write 
§& = Ey efvts 
Pi= Pi, ec; (6) 
P. — Pea eset . 
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The expressions (4) and (5) then become 
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A solution to the equations (7) which satisfies the requirements that 
(a) it approach zero as r approaches infinity and (b) there be no 
singularity at r = 0 is the following: 


Ey =a e-ske* f 
Pea dG | ee a (G04) 2). 7 Le (8) 
Pea — A, K| (#&,? — (w/e.)?)?r] ei" , 


where a, A; and A, are constants to be related to one another through 
the boundary conditions; J, and K, are Bessel functions of zero order 
which are especially suited to the problem since k; > w/c; or w/Ce. 
This latter relation is obtained as follows, kz, = w/Vw where Vw is 
the phase velocity for a sinusoidal disturbance traveling along the 
membrane. We are interested in phase velocities which are, in gen- 
eral, much smaller than the velocity of sound in the fluid media. This 
determines the direction of the above inequality. 

In order to interrelate the amplitude constants and to determine 
the propagation constant k,, we must apply the appropriate bound- 
ary conditions at the surface of the membrane, that is, the radial ve- 
locities of the fluid media at r = 7) must equal the membrane velocity. 
Now the relation between the radial velocity, u,, and the pressure is 
(see, for example, Morse, 1948, chap. v; note the sign change due to 
the choice of e-** in place of e7’' of this paper) : 
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We obtain, therefore, the expression 
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Consequently, 
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But (see, for example, Watson, 1944, p. 79) 
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Upon substituting (12) into the first of equations (7), we obtain 
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Since the first equation of (8) is a solution of (13), we have the 
following implicit expression for k, 


bt ot ae aa 
(14) 
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The phase velocity V. can then be expressed implicitly as 
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As indicated above, we are primarily concerned with phase velocities 
which are much less than the velocities c; and c,. In addition, since 
o << pi, we can neglect the first term by comparison with the other 
two. For such situations, we can write (15) as 


es it 
"Tes Ue/Th),, + pe (Ko/Ki) , 1” 


where the argument of the Bessel functions is (w7/Vwo). Now the 
magnitude of this argument increases with increasing values of both 
7, and w. Let us consider, then, relatively large values as follows: 
T) = 2(10)2 cm, Vu = 2(10)? cm/sec and w = (10)* sec. The first 
value is the approximate radius of a giant squid axon, the second is 
the conduction velocity in the axon and the third value corresponds 
to a frequency component © 2(10)*. From (16) we then obtain the 
result that for equal densities of the interior and exterior fluids the 
value of the second term in the denominator is only about one-third 
the value of the first term. If the exterior fluid is then replaced by 
a fluid of very low density (gas) the phase velocity will change by 
less than 10%. For a value of w = (10)*, the phase velocity will 
change by about 2% for the same exchange. We can thus conclude 
that the density of the external fluid is of minor importance in deter- 
mining the velocity of propagation. 

If we restrict the range of w to values such that w < (10)°, then 
we can rearrange expression (16) in the following approximate form 


(15) 


Vo* (16) 


2 
Vout —_= a Do (17) 

2 pi 
We see that the phase velocity is proportional to the square root of 
@, i.e., the membrane behaves like an anomalously dispersive me- 
dium. In other words, the group velocity of a wave packet is greater 
than the phase velocity of the midfrequency component. In this case 
V, = 2V,. If we substitute from expression (8) for JT, we obtain 
the relation 
__ w ro? A Po 


4 
Vo i 


(18) 
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If the tension is the result of a mechanism other than a hydrostatic 
pressure difference it appears necessary to postulate 7 « rm. The 
quantity A P, in (18) is then replaced by a proportionality constant. 
Now if we assume that the quantity A P, is constant, 1.e., independ- 
ent of 7,, the result for the phase velocity is 


AP, \"4 
Vani at ( a) (19) 


The third equation of (8) indicates that the rapidity with which 
a disturbance drops off as one moves radially away from the surface 
of the tube is greater for the larger values of w. For example, if one 
inserts the values 7 = 2(10)2 cm, Vo. = 2(10)* cm/sec and wa = 
(10)* sect, one computes from the last equation of (8) that at one 
diameter distance from the membrane the amplitude of the disturb- 
ance is only 1/10 the amplitude at the membrane. 

For purposes of comparison with experiment, we are, of course, 
interested in the velocity of-propagation of a fairly localized disturb- 
ance along the length of the tube. We will show that as long as a 
signal velocity has a meaning for any particular disturbance, this 
velocity depends on the radius 7, in the same way that the phase ve- 
locity depends on r,. This is shown with the use of the Fourier 
transform. (See, for example, Guillemin, 1935, chap ii.) We choose 
an exciting pulse f(t) for the displacement, €, of the membrane at 
z = 0. Then this pulse can be represented in terms of sinusoidal 
functions of time by means of the integral relation 


f(b) = f{_ 9) et da, 
nehers mas (20) 


g(o) == fre e-fot dt, 


From the first of equations (8) we see that the disturbance as a 
function of time will have the following form at a point z to the 


right of zero (an identical relation in time is observed at z, units 
to the left of zero) 


Ne a Io ee Oem (21) 


where Sw”? has been inserted for V.. The quantity 6 is a constant. 
We now refer to equations (17), (18) or (19). If we consider the 
movement of some point of the disturbance, such as a maximum or 
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peak value Gt this is possible), and define a signal velocity with 
respect to this point, then, from (21), such a signal velocity, Vs, must 
satisfy the following relation: Vs < 6. We can thus write 


AP, \3¥4 
Vs= br ) (22) 

pi 
where 6 is a constant which is to be determined by evaluation of the 
integral (21). The relation indicated by (22) between the radius Ts 
and the velocity Vs is that observed experimentally. 
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FIGURE 2. (a) Periodic excitation function applied at z= 0. (b) The pro- 
pagated response as a function of position along the tube. The curve labeled a 
is the exciting function, curve c indicates the response at a point twice as far 
from the position of excitation as that indicated by curve b. 


We have not carried out an exact evaluation of (21) for any 
pulse shape. However, to obtain a value for the order of magnitude 
of the signal velocity, we carry out an approximate analysis for a 
particular exciting function. We assume that the exciting function 
is a periodic one compounded from four frequencies (multiples of 
1/56) of the form indicated in Figure 2a. Then we obtain for f(t) 
at the two points to the right of the origin z = ¢(2éd/2)? and 
Zoo = 2c (2d/2)/2, where c is related to the phase velocity of a compo- 
nent corresponding to w = (22/65) (0.8) by Vo = e[ (22/65) (0.8) 1”, 
the disturbances labeled (b) and (c) in Figure 2b. It is evident that 
one can identify a signal velocity if one does not consider points too 
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far removed from the point of excitation. In this particular case, the 
signal velocity is equal to about 0.9 of the phase velocity of the high- 
est frequency component present in the exciting function. A calcu- 
lation based on a value of this order, for a case corresponding to a 
giant squid axon, yields a value of the required pressure difference, 
AP,, of the order of several atmospheres. Pressures of this order 
of magnitude appear somewhat high. However, from formula (18) 
we see that a doubling of the value of w decreases the value of the 
required pressure by a factor of four. The range of values of w is 
suggested by the shape of the spike potential and the membrane con- 
ductance change with time (Cole and Curtis, 1939). 


SO 
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_ FIGURE 3. Comparison of experimental values for the velocity of conduc- 
tion versus fiber diameter with the theoretical result. 


The graph of Figure 3 indicates a comparison of theoretical re- 
sults with experiment. The experimental values are from the data 
tabulated by C. L. Prosser (1946) for crustacea and cephalopod mol- 
lusks. The straight line is drawn for a 0.5 power relationship, i.e., 
the quantity AP, in (22) is taken constant. 

Recent experimental work lends support to the view that me- 
chanical motion coexists with electrical changes taking place in nerve 
during the conduction process. Dimensional and opacity changes in 
nerve in the region of electrodes supplying polarizing current have 
been observed by J. M. Tobias and S. Solomon (1950) ; Solomon and 
Tobias (1950) and Tobias (1950). Their results considered with the 
work of Dz K. Hill and R. D. Keynes (1949) on opacity changes ac- 
companying propagated electrical activity suggest the possibility of 
mechanical motion during activity. The ease of stimulation of nerve 
tissue by mechanical means also lends support to such a picture 
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(Blair, 1936). 

Investigators (Curtis and Cole, 1942; Hodgkin and Huxley, 
1945; Hodgkin and Rushton, 1946) have found useful the concept of 
an inductive element in the equivalent electrical circuit of nerve. The 
magnitude of the required inductance is large, of the order of 0.2 H 
cm’. It is not necessary that an inductive element as such exist in 
nerve tissue. Indeed, one of such magnitude would be difficult to 
accept; mechanical motion, however, would manifest itself in the 
equivalent electrical circuit of the tissue as a relatively large induc- 
tive element. 

It is an immediate consequence of (17) and (20) that as the 
tension varies so does the signal velocity. Experimental results in- 
dicate that the conduction velocity may increase when a nerve is 
longitudinally stretched (Bullock, 1945). Such stretching would in- 
crease the tension, 7, in relation (17) and thus increase the velocity 
if the change in radius 7, does not over-compensate. It would be re- 
vealing to check the theory against precise quantitative data on this 
effect. The experimental evidence that lateral squeezing reduces the 
conduction velocity (Gasser and Erlanger, 1929) is a direct conse- 
quence of (17) since the tension, 7’, would be reduced in the region of 
lateral compression. This theory is not inconsistent with the results 
presented by H. Grundfest (1936) on the effects of hydrostatic pres- 
sure. No change in the conduction velocity would be observed on this 
picture if the difference between internal and external pressures 
remains constant. 

The theory presented in this publication is entirely mechanical. 
In order to correlate the electrical manifestations of the conduction 
process of invertebrate nerve with such a picture, it is necessary to 
postulate an electromechanical coupling mechanism. The mechanical 
motion might then serve as a sort of trigger for electrical “break- 
down.” In any event, such considerations await further investigation. 

It is concluded on the basis of the theory developed herein that 
mechanical motion may be a primary factor in determining the ve- 
locity of propagation of the conduction pulse of thin sheathed nerve 
fibers. 

We wish to thank Dr. V. J. Wulff for his interest and for help- 
ful discussions. 
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membrane surface 

radial coordinate 

radius of cylindrical membrane 


> 
Fy 


r 
T tension per unit length in the membrane 
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Bessel functions order zero and one, respectively 
27f where f is the frequency 

width of exciting pulse at half amplitude 

angular coordinate 

radial displacement of membrane 

a radial displacement amplitude of membrane 

Pir Pe densities of internal and external media, respectively 
o mass of membrane per unit area 
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The input-output formula is derived for a neuron upon which con- 
verge the axones of two other neurons (one excitatory, the other inhibi- 
tory) which are themselves subjected to a “Poisson shower” of excita- 
tory stimuli. If the period of latent inhibition, 7, does not exceed one half 
the refractory period, 5, the input-output curve has no maximum. If, 
however, ¢ > 6/2, a maximum exists in the input-output curve. As the 
outside frequency x increases without bound, the output frequency Xz 


approaches an asymptotic value which ranges from 1/6 to 0, depend- 
ing on the ratio o/6 . The maximum output (if it exists) is also derived 
as a function of o and 6. 


In previous papers, I and II of this series (Rapoport, 1950a,b), 
we derived input-output functions for a single neuron receiving a 
shower of completely randomized (Poisson-distributed) stimuli. In 
most examples treated in those papers the stimuli were supposed to 
come from outside the organism. However, in the case of inhibition, 
where some of the stimuli were supposed to be excitatory and some 
inhibitory, it seemed too artificial to suppose that the shower of out- 
side stimuli could be subdivided into two “‘sub-showers,” one excita- 
tory, one inhibitory. We supposed, therefore, that a homogeneous 
shower of stimuli impinged upon two neurons, and that it was these 
neurons which produced respectively an excitatory and an inhibitory 
effect upon a third neuron. The output of the third neuron as a func- 
tion of the outside input was then derived. 

This model was interesting chiefly because of the maximum 
which characterized the output curve of the third neuron with re- 
spect to the outside input. It was shown in I that the existence of 
such maxima allowed the construction of “filter nets” in which cer- 
tain neurons would respond only to certain ranges of the outside in- 
put frequency. Moreover, these ranges could be narrowed to any 
desired degree by simply raising the thresholds of the neurons. Thus 
a counterpart of a resonance phenomenon could be accounted for in 


a neural net. 
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The weakness of the filter net model discussed in I lay in the 
drastic simplification of neglecting the refractory period. This al- 
lowed the problem to be treated by quite simple mathematical means. 
In the present paper, it will be supposed that neurons have a finite, 
constant refractory period 6. The inclusion of a refractory period 
complicates the mathematical treatment considerably, as will be seen, 
since the time distribution of firings of the neurons is no longer a 
Poisson distribution. Moreover, the existence of a maximum in the 
output (and hence the possibility of using the model for constructing 
“filter nets”) will now depend on a certain relation between the re- 
fractory period and the period of latent inhibition. 


FIGURE 1 


Consider a net such as is shown in Figure 1. The firing condi- 
tions for such a net in the McCulloch-Pitts notation (McCulloch and 
Pitts, 1943) would be, under the assumption of quantized time, where 
the synaptic delay is taken as the unit of time, 


Ns (le ENG CG el ee aCe ie (1) 


We shall retain the synaptic delay (assumed constant for all 
neurons) as the unit of time. However, we will not demand that fir- 
ings occur only at moments which are all multiples of the synaptic 
delay measured from some common origin, as is supposed in the 
above-mentioned paper of McCulloch and Pitts. Now if 6 designates 
the refractory period, and o the period of latent inhibition, we may 
rewrite equation (1) as follows: 


N;(t) iy ete [emsco -N;(é— 71) ] 


li 


“iri oo N (Ces =e oo 
That is to say, “N; fires at time t, provided N, has fired at time 
(¢ — 1), and provided N; has not fired within the interval 6 before ¢ , 
and provided N. has not fired within the period of latent inhibition 
prior to the arrival of the impulse at N; from N, .” 

We are interested in the output of N; as a function of the aver- 
age frequency « of the outside stimuli. Evidently, the output will not 
be the same as that derived in I, where the refractory period was 
neglected. In that case, the stimuli received by N; from Nz became 
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infinitely frequent with increasing 2, so that almost all of the im- 
pulses from N, were “canceled” by stimuli from N.. The limiting 
output x; of N; as « increased without bound was zero. As we shall 
see, this will not be the case with the present model. 

Suppose « increases without bound, so that both N, and N, fire 
with their limiting frequency 1/5. Since N, and N, are independent 
of each other, there is no way of knowing how their firing patterns 
will “lock” in the limiting frequencies. If they lock so that the fir- 
ing of N, follows that of N. at less than o units, all of the N, stimuli 
will be canceled by N., and N, will not respond at all. On the other 
hand if N, follows N. at more than o units, N. will have no effect, 
and N; will respond every time, namely, with the frequency 1/6. 

Thus a definite input-output curve for N; cannot be derived on 
this basis. If, however, instead of a single neuron N» we consider a 
whole aggregate of such neurons, then the average output intensity 
(frequency of firing per neuron) can be computed.t This problem 
will be considered here. 


Firing Distributions of N, and N.. 


We shall introduce a certain distribution function, which is a 
generalization of an analogous function in the case of the Poisson 
distribution. For a Poisson distribution of a sequence of events, the 


function 
P(t) =2xe* (3) 


represents the probability distribution with respect to the time of 
occurrence of the next event, where the origin of the time coordinate 
can be chosen arbitrarily. Evidently this distribution does not hold 
if the “events” are the firings of a neuron with a finite refractory 
period, since the choice of origin does make a difference in this case. 
If, for example, the origin is chosen exactly 6 units after a firing 
instant, then, since the neuron is now responsive to every incidence 
of outside stimulation, the probability distribution of the time of the 
next firing is indeed given by (38). But if the origin is chosen at a 
firing instant, then, obviously P(t) = 0 for t < 6, so that (3) does 


not hold. 


+Strictly speaking the same reasoning can be applied to a single neuron if 
the frequency is averaged over sufficiently long periods of time, since there are 
bound to occur “shifts in phase” in the firing patterns due to random fluctua- 
tions in the instantaneous frequency of the outside shower. Such long interval 
averages seem, however, to be of slight physiologica] interest, if our definition 
of the firing frequency of a region involves a more or less instantaneous fre- 


quency (short interval average). 
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We wish to compute the probability distribution of the time of 
the next firing for N, for the case where the choice of origin is wun- 
known. For this purpose we introduce a function which represents 
the probability distribution of the time of the next firing, where it 
is supposed that the last firing occurred at t = —y. This function 
is given by 


Oly, t)—O0 Or 0st = 0-— Yay or 
p(y, t) =x Exp{—a(t—o+4)} fort ~ oy ee 
o(y,t)=xe*"* for y2Zo. 


The function we seek is P(t), the “expected” form of p(y,t?), 
i.e, its “average” with respect to all possible choices of y. Since 
the probability of the time of occurrence of the “next” event is the 
same whether time is measured forward or backward, if the choice 
of origin is not known, it follows that if we choose an instant at 
random, the probability of choosing an instant between y and y + dy 
following a firing is P(y)dy. Hence to get P(t) we must multiply 
p(y,t) by the probability of y, P(y)dy, and integrate over the en- 


tire range of y. This leads to the following integral equation for 
feta 


xe) (oe) 
P(t) = P(y)e Exp|—a(t—o+ way + f P(y) xe" dy S65) 
6 


6-t 


L6rt 0° 
6 | 2 

P(t) = | P(y)2 Exp|—e(t—0 + lay + f, Pmer ay ©) 
0 6 


1OtsGo D 


At t = 6 the two expressions are identical, so that P(t) is a 
continuous function of ¢ everywhere. Equation (6) can be immedi- 


ately reduced to a differential equation by differentiating both sides 
with respect to t. In fact, 


P(t): aP it) osfory aes (7) 
which upon integration yields 
P(t) =Ae**, (8) 


where A is independent of t (is a constant of integration to be com- 
puted below). 


For ae the situation is somewhat more involved. Differen- 
tiation of (5) with respect to t gives a differential-difference equa- 
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tion, 
P(t) =—a«P(t) + «P(6—#), for tsé. (9) 


We note that if P(t) is taken independent of ¢ in the interval 
0 =t = 6, equation (9) is satisfied. This suggests that we seek a 
function of « and 6 alone, which would satisfy (5). Moreover, such 
a function must also satisfy (6), since the first integral on the right- 
hand side of (6) involves P(y) in the interval (0, 6). 

Let us first compute P(0). The first integral on the right side 
of (5) vanishes, and we have 


P(0) = [Pama ; (10) 
6 : : 


But the form of P(y) for y > 6 is given by (8). Hence 
P(0) = Ae*6, (11) 


Similarly, we compute P(6) and find it to be also equal to Ae*>. This 
suggests that P(t) might be Ae in the interval 0 = ¢ S 6, and it 
remains to compute A. This quantity can be easily determined by 
the normalization of P(t). We have 


2 co re) ) 
| P(tyat= { Ae-8dt + J Ac*tdt=1; 
ts) 0 fe) 


. (12) 
d6Ae* + Ae*/a=1, 


whence 
A = xe (wd + 1). (13) 


Therefore, we guess the solution of the integral equation (5) and 
(6) to be 
P(t)=2(#6+1) for 0St26; 


(14) 
P(t) =x Exp {—2(é— 6)}- (26 —1)77 


fort. 0: 
P(t) 


P(t)=x(xd+1)7! 


1 

1 

1 

1 

1 
t=d 
FIGURE 2 
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Substitution of (14) in (5) and (6) reduces the latter to iden- 
tities. Hence, by the uniqueness of the solutions of (5) and (6), 
P(t), as given by (14), is the function sought. The graph of P(t) 
is shown in Figure 2. 

We are now ready to compute the output intensity of an aggre- 
gate of neurons of the type N,. If an arbitrary instant is chosen, 
the probability that N. has not fired for o units prior to that instant 
is given by 


OG aT f Pat. (15) 


There are two cases, o < 6 and 6 < o. The first case seems biologi- 
cally more plausible,t but for the sake of completeness, we shall 
treat both cases. 


Case: 0 < 6. 


A necessary condition for the firing of N; in an infinitesimal 
interval (¢, ¢ + dt) is that it receives an impulse from N, and has 
not received one from N, for at least o seconds prior to t. We must 
show that the condition is also sufficient. But the only thing that can 
prevent Nz; from responding to a stimulus received from N, which 
has not been “canceled” by a stimulus from N, is the possibility that 
the stimulus from N, has fallen within the refractory time of N;. 
But if this happens, it means that N; has received two impulses from 
N, within less than 6 seconds, because N; can be fired only by N,. 
But this is impossible because of the refractory period of N,. There- 
fore any stimulus coming from N, cannot fall within the refractory 
period of Nz, and the above-mentioned condition is sufficient for N; 
to fire. The independence of N, and Nz now allows us to write down 
the probability that N; fires in any given infinitesimal interval dt, 
provided o < 6. This is 


xdt = x(xd +1)71Q(c)dt=24(x2d + 1)2 
» [Lb ortae 1) lore 


The average frequency of firing of N; is obtained by dividing the 
right side of (16) by dt. 
If the output is to have a maximum, then the equation 
dis — uf Sat 20% es 
de  (w6+1)? (#6+1)3 — 


+See, however, the remarks in this connection in IV (Rapoport, 1950d). 


(16) 


0 (17) 
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must have a solution for a positive value of «. Equation (17) indi- 
cates that the maximizing value of the input is given by 


x* = (26 — 6), (18) 


which is positive only if ¢ >d/2. There is thus a lower limit, namely, 
6/2, which o must exceed in order for the aggregate of neurons of 
the type N; to exhibit a maximum in the output. 

If we designate by x, the average frequency of N, (which by 
symmetry is also that of N.), we can easily compute 2, , the value of 
x, which maximizes x;. This turns out to be independent of 6 and is 
given by 


Te (2o)e, Ore 0 = or 0/ 2. (19) 


The maximum value of x; is likewise independent of 6 and is 
given by 
6; = (4c). (20) 
Thus whenever the average frequency of the neurons N; is maxi- 
mized, it is just one half the frequence of the neurons N, and 
N.. We may also compute the limiting frequence of N; as % in- 
creases without bound. This does depend both upon 6 and «. Since 
Lim x, = 1/6, we have 


L200 


Lim #,; = 1/6 — o/6*. (21) 


@->00 


If « < 6, this is positive and approaches 0 as o approaches 6 (cf. IV). 


Case: o > 6. 


We now have 


Ox a5 
== + Ex 
Jeo v6 +1 | Toc yi 


6x __ Exp 10 ( 0a) pk (22) 
COL xo +1 
Exp {x (6 — o)} 


ro +1 


—x(t — | dt 


— 


——— 


Then 
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Q(o) =1— ['P(dt=Exp| 2(o—0)| (ois) s+ (23) 
0 


and 
== 2 Exp {7 (6— 6) (20s (24) 


Since 6 — o < 0, it is evident that Lim 7, = 0. 


la 


X3 


X3=% 


FIGURE 8 


To obtain the maximizing input, we set 


dx; 
dx 
(25) 

__ (@6+1) [Exp{a (6—c) } (1+%6—ae) ]—26aExp{x (6—c) } 0 
a3 (x5 +1)4 
and after simplifications obtain 

x? (6? — da) — a0 +1=0, (26) 
which yields 

< Vent We = dps 
._..¢= No o (27) 


26 (6 —o) 

Discarding the negative root, we obtain for the maximizing input 
~_. Va? + 460 — 462 —o 

25(«— 6) : 


Note that the limit of x* as a approaches 6 is 1/5, as should be the 
case. 


(o>6). (28) 
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The determination of x," and x,° is straightforward. We have 
wy" = (Vo? + 400 — 46? — o) (Vo? + 460 — 462 +o — 25) (29) 
wea oo) Cae ie) o— Vo? + 4d0 — 462 
—_— hw — ss ee x 


aes Sears (00 
(Vo? + 460 — 40? + « — 26)? 26 ey 


* 
3 


This investigation is part of the work done under Contract No. 
AF 19(122)-161 between the U. S. Air Force Cambridge Research 
Laboratories and The University of Chicago. 
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Input-output formulas are derived for a neuron upon which con- 
verge single axones of two other neurons, which are subjected to a 
Poisson. shower, where a number of different assumptions are made 
concerning the mechanism of inhibition. 

In one assumption so-called “bilateral pre-inhibition” is considered. 
That is to say, both neurons N. , and N, may excite N,, but if the stimu- 
lus of one of them follows within a certain interval o of the other, the 
second stimulus is not effective. This model is essentially no different 
from that involving two excitatory neurons acting upon a neuron hav- 
ing a refractory period. 

Another mechanism considered involves so-called “pre-and-post’’ in- 
hibition, in which if two stimuli from N, and N, fall within o, both 
are ineffective. This case being mathematically much more involved 
than the preceding, an approximation method is used for deriving the 
input-output formula. 


It is assumed in the so-called two-factor theory (e.g., Rashevsky, 
1948; Householder and Landahl, 1945) that neuroelements may be 
specifically characterized as either excitatory or inhibitory, that is 
to say, the action of some terminal bulbs enhances the probability 
of firing of the cell body they impinge upon, while the action of 
others, on the contrary, tends to counteract this. The discussions 
based on the two-factor theory have usually been so worded as to 
imply that the respective actions of excitatory and inhibitory bulbs 
are essentially different. 

Actually it is not necessary to assume this. W. S. McCulloch 
has pointed out (in a lecture, May 25, 1950) that it is possible to 
account for a specific inhibitory effect of one newron upon another 
without postulating any specific inhibitory action of its end bulbs. 

Consider the net shown in Figure 1. The neuron N, fires if si- 
multaneously excited by N. and N; (that is, the period of latent addi- 
tion is zero). If N; does not fire, the firing of N; is sufficient to fire 
N, via N2 and N;. If, however, N; fires within 6 units prior to Ni, 


+Previous papers of this series are denoted by I, II, and III in this paper. 
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FIGURE 1 


where 6 is the refractory period of N;, then Ns will not be able to 
respond to the stimulation by N,, since the stimulus from N, arrives 
at N; within its refractory period. But since N, has threshold 2, N2 
is not sufficient to fire it. Essentially, then, N; does produce an in- 
hibitory effect upon N, with a period of latent inhibition equal to 6, 
the refractory period of N;. 

If the refractory period of N, does not exceed 6, it is seen from 
this model that it may be useful to consider cases in which the period 
of latent inhibition exceeds the refractory period (cf. III). 

Note that although the McCulloch model obviates the necessity 
of postulating specific inhibitory action at the end bulbs, it is never- 
theless formally equivalent to the two-factor theory models. The 
computation of the input-output curve of N. with respect to the fre- 
quency of Poisson-distributed stimuli impinging upon N, and N; 
would proceed exactly as in III where o would now be the refractory 
period of N; and 6 the refractory period of N,. 

As in the case discussed in III, it follows from the model repre- 
sented in Figure 1 that 

1) the action of neurons is specific (either inhibitory or exci- 
tatory, e.g., N, always tends to excite N., while N; always tends to 
cancel the action of N;); 

2) to effect inhibition, the inhibitory stimulus must arrive be- 
fore the excitatory stimulus which it is to “cancel.” 


N,O-——— () ———On 


FIGURE 2 
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It is, however, possible to generalize the picture still further by 
dropping assumptions 1) and 2). In fact, let neurons N, and N, 
converge upon N; as in Figure 2. 


The firing equation assumed in III was 


N;(t) p=" NG (61 eels 0 "= N:(t—7)] 
(1) 


. [t2 <o c= N2(t—7r.2—1). 


But we may assume instead of equation (1) any of the following 
equations 


N;(¢) -=" N,(t—1) [mn <6°= N3(t—73] 
. ph hae Soe (2) 
[tr2 <0 '= No(t + 72—1)] ; 
N;(t) “=: (Ni(t—1) * [n <6 *=" Ni (t—1)] 
. [rm <o ‘=: No(t—7—1)]}, 
(3) 


v{N.(¢—1) ¢ [11 <6 -=- N,(t—71)] 
[rm <o = Ni(¢—72—1)]}, 


and still others where the expressions N;(t — tm — 1); are replaced 
by Ni(é + w— 1) or by Ni(ét = 72 — 1); (6 = 1,2). It is assumed 
throughout that o < 1, i.e., does not exceed the synaptic delay. 

Equation (3), for example, implies that either N, or N. may ex- 
cite N;, but that either N, or N. may cancel the stimulus of the 
other, provided its stimulus arrives within o (period of latent 
inhibition) units before the stimulus of the other. Replacing 
Ni(t — to — 1) by Ni(t + 72 — 1) implies that inhibition can occur 
if the inhibitory stimulus arrives simply within o units of the ex- 
citatory stimulus and may either precede or follow it. The term 
Ni(t + 7 — 1) requires that the inhibitory stimulus follow the ex- 
citatory one (provided, of course, that it arrives within the synaptic 
delay, which is guaranteed by the condition, o < 1). 

Underlying each of the equations (1)—(3) and the alternatives 
suggested above is an interpretation of its possible “physical signifi- 
cance.” Thus the model described by equation (2) may be the fol- 
lowing: Firing takes place due to an accumulation of “excitatory 
substance.” Inhibition consists in the destruction or neutralization 
of the excitatory substance but not by an accumulation of a specific 
“inhibitory substance.” Thus, to be effective, the inhibitory stimu- 
lus must follow the excitatory one, since prior to the excitatory stimu- 
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lus there is no excitatory substance to neutralize. 

Taking equation (3) with the terms Ni (¢ — 72 — 1) replaced by 
N(t = 7 — 1), we may interpret it as follows. Firing takes place 
as a result of an increased concentration of ions (accumulating dur- 
ing the synaptic delay) within the cell body near the point of excita- 
tion. If the bulbs of N, and N, are at such a distance from each 
other as to “compete” for the available ions, the (nearly) simultane- 
ous activity of both neurons may preclude a sufficient concentration 
at either connection. Thus each neuron inhibits the other and the 
“inhibitory” stimulus may precede or follow the other. 

Which of the equations (if any) is the most useful depends, of 
course, on the actual physical events associated with excitation and 
inhibition. In our formal treatment we will leave the question of 
physical significance open and will only work out the consequences 
of some of the assumptions. Equation (1) may be considered as rep- 
resenting “unilateral pre-inhibition,” i.e., only N» inhibits and does 
so only if its stimulus arrives prior to that of N,. Equation (3) rep- 
resents “bilateral pre-inhibition” (both N, and N. may excite or in- 
hibit, and the inhibitory stimulus must come before the excitatory). 
Entirely analogously we may represent unilateral and bilateral “‘post- 
inhibition” and unilateral and bilateral ‘“pre-and-post-inhibition,” 
which, to avoid a clumsy term, we shall designate as simply “‘inhibi- 
rons: 


Bilateral Pre-Inhibition or Simple Summation. 


In the case of bilateral pre-inhibition N; can be fired by either 
N, or N, provided it had not fired for 6 seconds previously and pro- 
vided it had not been inhibited for « seconds previously. The neuron 
N; remains unresponsive for 6 units after firing. If 6 > o, then, 
after the refractory period has passed, N; can be fired again by 
either N, or Nz. On the other hand if 6 < o, then Nz remains re- 
fractory for o units after having been fired by N, or N..-In either 
case, only one of the parameters (the greater of 6 and os) plays a 
part. The case of bilateral pre-inhibition thus reduces to one of sim- 
ple summation of frequencies modified by a refractory period. That 
is to say, N; responds to a stimulus from either N, or N., provided 
the stimulus does not fall within the refractory period of N;. For 
simplicity, the refractory periods of all three neurons will be taken 
equal to 6. 

If Ns; were responding to two superimposed Poisson showers, 
each of frequency « , then the response of Nz would simply be 
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20 


ae ZO 


AX 
(4) 


in accordance with the considerations presented in I. However, the 
stimuli impinging upon N; from N, and Nz are not Poisson-distribut- 
ed, and we cannot simply take the sum of their frequencies for the 
input of N; . 

We shall first compute the function (6, t), the probability dis- 
tribution of the time of the next firing of Ns, when the origin has 
been chosen at a moment 6 units after a firing. From this the ex- 
pected time of firing can be computed. Adding to this expected time, 
the refractory period 6, one obtains the expected time between two 
Successive firings and hence the average frequency. 

It is evident that p(6,t)dt is the probability of receiving the 
first stimulus either from N, or from N. during the interval 
(t, t+dt), it being known that a stimulus was received from one of 
the neurons at the instant (—6é). 

Suppose for definiteness that the last stimulus was received 
from N,. Then, since the refractory period of N, has passed, the 
probability distribution for the time of the next stimulus from N, 
will be given by xe’, which is simply the probability distribution of 
the time of the next stimulus from the outside. Nothing, however, is 
known about the state of N» at our origin. Therefore, the distribu- 
tion of the time of the next firing of N. must be given by equation 
fit) 01 1 tl, i.e., 


t(toe 1). s tory. 0 = t=0; 
x Exp{—«(t—6)} «(ad +1)7, fort>o. 


Now the probability that the first stimulus after t = 0 received 
sitions may be considered mutually exclusive, since the probability 
that either this stimulus came from N,, and no stimulus came from 
N. in the interval (0, ¢ + dt), or that it came from N.2 and no 
stimulus came from N, in the interval (0, t + dt). The two propo- 
sitions may be considered mutually exclusive, since the probability 
of simultaneous arrival of stimuli in the interval (¢, ¢ + dt) is an 
infinitesimal of the second order. We thus obtain 


t t 
adt x 
=gert(1— | ———)+ 1— mal 6 
p(6,t) =e (1 [#4) T+ [= (6) 


Tort. =.0; 


(5) 
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al 
ae ale? adt [| «Exp i{—#(t—9)} 
p (3,1) =2e*t| 1— _wtl |. xd +1 


(7) 
4 x Exp {—x(t — 6)} = edt | 
TO a1 j 
LOM tee ton 
After performing the indicated integrations, we get 
2% + 02(6—T) 
t) =e | ————_ , 40f) -f =; 8) 
p(6,t) =e eA | ( 
2a Exp {—a« (2t — 6)} 
0 (6, t) = ———__,, for #20. (9) 
TOs 


For t = 6, the two expressions (8) and (9) coincide. 
To obtain the expected time of firing from our origin, we evalu- 


ate i) tp(6,t)dt. The integration is laborious but straightforward. 
0 


In the interval (0, 6) we obtain 


6 
6(1 — e*) 
tp (6, t)dé = —_———___, 10 
i p(o,t) ey (10) 
while in the interval (6, 0), we have 
i (205 + 1) e*6 
tp (6; §) d§é= ——_—__——.. 
il DAO eb) 24 (80 421) Ste 


Hence the expected time of the next firing from our origin, which, it 
will be recalled, was chosen 6 units after firing, will be 


} pgeteorces 


Finally, the expected time of firing from a moment of firing is evi- 
dently 


2x76? + 4x6 + e-*6 


E(0) = = 
Dae 2x (76 + 1) 


(13) 
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The average frequency of a3, being the inverse of the expected time 
between two successive firings, will then evidently be 


ae 2a (0-1) 
* 2228? + dad + €*8" sna 
It is seen that for very small values of a, x; behaves like 2x , while 
for very large values, it approaches 1/6 asymptotically, as should be 
the case. 


An Approximation Method. 


The problem of bilateral pre-inhibition, which, as we have seen, 
reduces to a simple summation of frequencies modified by the re- 
fractory period (assumed equal for all three neurons) can also be 
attacked by an approximation method. We shall use this method in 
the considerably more complicated case of bilateral inhibition (pre- 
and post-inhibition). Having solved the simpler pre-inhibition prob- 
lem by both methods, we shall be able to estimate for that case the 
accuracy of the approximation method. As we shall see, the dis- 
crepancy does not exceed 8% in the simpler case. Hence, it is at 
least plausible that the method is useful in the closely related, but 
more involved, case of bilateral inhibition. 

Consider all the stimuli received by N; from N, and N. (whether 
effective or not) distributed as points on the time axis. Because of 
the refractory period of N;, only a fraction of these stimuli will be 
able to fire N;. Let us now determine this fraction @. 

To do this, pick an arbitrary stimulus and ask what is the prob- 
ability that it is “effective,” that is, that it fires N;. This is evidently 
the case if, and only if, an effective stimulus has not been received 
for 6 units prior to it. To determine the probability that no effective 
stimulus has been received within a given interval of time, we must 
know the probability distribution of the time of its first occurrence 
from a given origin. 

Essentially, the “exact” method above was based on the deriva- 
tion of such distributions. In our approximation method, we suppose 
that the rectangular distribution which we have derived for the first 
occurrence of any stimulus within 6 units of an arbitrarily chosen 
moment is simply modified by a constant factor to give the distribu- 
tion of the first effective stimulus. This constant factor is evidently 
approximately 6. 

We thus have the following equation for the probability that a 
given stimulus is effective. 
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6 
adt 66x 
oa of go +1 TO aie 


which, solved for 6, gives 


5 al 
— On6 +1 


(16) 


Multiplying this probability by the frequency of all stimuli received 
by N;, that is, 2x(a#6 + 1)-, we obtain for the approximate formula 
of the average frequency of N; 
2¢ 2x (xd + Le: 
Ove +1 2476? +3826 +1° 


“3 (17) 
The last expression of (17) has been put in this form to show its es- 
sential similarity to the “exact” formula (14). Note that (14) and 
(17) agree for both small-and large values of x. Maximum discrep- 


ancy between them has been shown by S. Levinsohn (1951) not to 
exceed 8%. 


Bilateral Pre-and-Post-Inhibition. 


We shall now apply our approximation method to bilateral in- 
hibition symbolized by the following equation: 


N;3(t) *=: {N.(é— 1) *[11<6°=: N3(é—7:1)] 


[re < oo ==" Nat eee) hes 
(18) 
Vv {Ne(t—1) Sites: N;:(t— 71) ] 


*[m<o°= N, (f+ 2—1)]}. 


That is to say, both N, and Nz can excite N, and each can cancel 
the other’s stimulus both before and after it is received. In other 
words, if stimuli from N, and N, arrive within the period of latent 
inhibition o , both are canceled. 

Again, as in the preceding example, consider all the stimuli re- 
ceived by N, from either source (whether effective or not) as a set 
of points distributed on the time axis. Clearly, the average density 
of these points will be 2x(a6 + 1)-. But only a fraction of the 


stimuli will be effective. We proceed to determine this fraction 6. 
We shall suppose that « < 6/2. 
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Note that an arbitrarily chosen stimulus received by N; is ef- 
fective if, and only if, it is not preceded by an effective stimulus for 
6 units and is not preceded or followed by any stimulus for o units. 
However, the two conditions are not independent, and the probability 
that both hold simultaneously cannot be obtained by multiplying the 
probabilities of each of the conditions together. Therefore, we in- 
troduce the probability Q(6, 20) which is defined as the probability 
that no effective stimulus precedes a given moment, where it is known 
that no stimulus of any kind occurred for o units before and after 
that moment. Now the probability that a given stimulus is effective 
ean be obtained by taking the product of the following two probabil- 
ities: 1) the probability that no stimulus of any kind precedes or 
follows the stimulus in question within the interval o; and, 2) the 
probability Q(6, 2c) defined above. As in the preceding case we 
again take a rectangular distribution for the first probability. Thus, 


2x0 

G=1 ee Meee) (19) 
It remains to compute Q(6, 2c). Essentially the computation in- 
volves the determination of P(t) as in equations (5) and (6) of 
III, except that the origin is now not entirely arbitrary since it is 
known that no stimuli have been received for o units before and 
after. The procedure will be equivalent to that followed in III if we 
choose our origin at (—c), measure time backwards, and suppose 
that no stimulus was received in the interval (—2c, 0). 

Then P(t), as given by the integral equations below, will repre- 
sent the probability distribution of the time of the first impingement 
of an effective stimulus (actually the probability distribution of the 
time of the last firing) 


ei) : P(y)« Exp |—#(t—9 Sus w)}ay “ [ Paeeray; (20) 
6 


6-t 


Rothe 0 — oo, 
6 200 
P(t) = { P(y)e Exp|—2(t—3+y)|dv + f P(y)e“'dy,, (21) 
20 6 


fori > 6 — 2c. 


Equations (20) and (21) yield solutions analogous to those obtained 
in IiI, namely, 
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Ab 
she lS ee ee eee 22 
EAL) (5 aoe eee z (22) 
Exp ee ore 
(tye 2) gostei esan (23) 
¢:(6.— 20) ae 


Since our origin was chosen at (—o) relative to the arbitrarily chos- 
en stimulus, the probability that the first effective stimulus was re- 
ceived anywhere within 6 units from ¢ = 0 (it being known that 
none was received in the interval (0, — o)), will be given by 


6-0 6-o 
Be oe Exp {—2(t—6 +2 
pi 22), [Eat )} 
‘ tO Zo eae Se x(6—2c) +1 
rk oe) 1 — e*" Ce 
x(d—2c) +1 aw4(d—2c) +1 
From (24) we obtain Q(6, 2c), the probability that no effective 


stimulus was received in the interval (— 6, 0), it being known that 
no stimulus of any sort was received in the interval (—o, «): 


Q(3,2c) =1—6 f[P@at 


_ (ud = Bite + 1—e**) ee 


x (6 — 2c) = cam I 
Substituting (25) into (19), we obtain the equation for @ , namely, 


=1 


i ; 2x0 6 (x6 — 2x0 + 1 — e*°) 
=| le te een rapes (26) 
arecnie 4b x(d6—2c) +1 
which, solved for 6, gives 
Ne tO — ote tek 
6S (27) 


~~ 246 — 2x6 + 2—e2e" 


For oc = 0, the right side of (27) reduces to (16), which, of course, 
should be the case. Multiplying 6 by the input intensity, 2a” (a6 + De, 
we finally obtain the approximate output intensity 

- 2% (xd — 2%0 + 1) 


j= See eee 
*  (@8 + 1) (206 — Qae + 2— ee)” (28) 


If the accuracy of the approximation (28) is comparable with 
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that of (17), it can be used to derive the critical value of « for which 
a maximum exists for x3. 

We note that for so = 0, certainly no maximum exists, since there 
is no maximum for positive x of the expression (17). On the other 
hand if « = 6/2 (the maximum value for which our computations are 
valid), a maximum of (28) must exist, since in that case Lim #, = 0, 


while dz;/dx > 0forx=0. 

To obtain o*, the critical value of «, we should demand that 
di;/dx = 0 have a positive real root. This involves the solution of 
a transcendental equation, which, however, can be easily obtained 
by graphical methods. 

An immediate problem which suggests itself in connection with 
the various models proposed for inhibition is to compare the critical 
values of o which insure the existence of a maximum in the input- 
output curve for each model, and the determination of the various 
limiting values of x; as x approaches infinity. The study of actual 
input-output curves should then indicate whether any of the pro- 
posed models can be taken as a hypothetical mechanism for excita- 
tion-inhibition phenomena. 

This investigation is part of the work done under Contract No. 
AF 19(122)-161 between the U. S. Air Force Cambridge Research 
Laboratories and The University of Chicago. 
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In line with a suggestion made in the recent book by the author, a 
distribution function is derived for the distribution in a social group of 
such conservative quantities for which every individual has a minimum 
eels he this minimum requirement varying from individual to in- 

ividual. 


In chapter ix of our recent book (Rashevsky, 1950) it was point- 
ed out that if a conservative quantity, that is, a quantity whose total 
amount remains constant, is distributed among the individuals of 
a population at random, the distribution curve is a simple exponen- 
tial. The Number N(x)dz of individuals who possess an amount of 
the quantity in question between x and x + dx is of the form 


N(a)dxa=Ae“dz. (1) 


This is formally identical with the Boltzmann energy distribution. 
However, as remarked in loc. cit., if « denotes incomes or wealth, then 
the expression (1) has meaning only for x > €, where & is the mini- 
mum amount of subsistence, since for « < & an individual cannot 
exist. The distribution function of such quantities should therefore 
be represented by the curve shown in Figure 1 of chapter ix of 
loc. cit. That is, the curve is zero for x < &, and is given by 
A exp [—a(a — &)] for ~ > &. It thus exhibits a sharp angle at 
xa=—é. 

Actual distribution curves of incomes, while roughly similar to 
that, do not exhibit the sharp angle. It was suggested in loc. cit. that 
this may be due to the circumstances that € varies from individual to 
individual, and that a smooth curve is obtained by a superposition of 
an infinite number of “angular” curves with different values of &. 

In spite of the serious conceptual difficulties of this explanation, 
which were discussed in loc. cit., we propose in this note to derive 
an expression for a smooth curve generated in the above-mentioned 


way. 
309 
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Let U(é)dé be the number of individuals whose & lies between 
& and € + dé. There is actually no human being for whom = 0, so 
that the function N(é) has a meaning only for > A, where A is 
the smailest possible value of €. It is also at least plausible that there 
is an upper limit 4, of €, although the assumption of such an upper 
limit is not essential. The quantity x is assumed to be distributed ac- 
cording to A exp[—a(x — &)] for each group of individuals charac- 
terized by a given €, and in each group x > &. Hence the resulting 
distribution function F'(«) is given by 


F(z) =A [re ett) df= A ee is U(éye* dé, (2) 
JA 
If an upper band 4, for & exists, then for x > A, 
F(x) =e [Cue ett dé. (3) 
A 
Thus for « > A, F(x) in this case is a simple exponential of the 


same form as (1). 
We may choose for U(a) an inverted parabola 


u() =B[ @— 4) - S| (4) 


In this case U(w) is zero for 6 = A and = A,. If Np is the total 
population, then 


»Ay 
A 


and therefore 
6N, 


B= 
(4,— A)? 


This choice of U(&) leads to rather cumbersome expressions. A sim- 
pler one is obtained by putting 


U(E) = N, 8? (6-4) eh em, (7) 


In this case there is no upper limit As. 
Introducing (7) into (2) we find, putting AN, = C, 


F(«) = co] gs gales) =| = cs oe ) ete-s) | * (8) 


For b >> a this reduces to 


(6) 
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F(x) =C {e*@4) — [1 +b0(2— A)] e?@}, (9) 


The function F(x) is plotted for a = 44 X 10° dollar“, 
b = 6 X 10° dollar--, C = 10° individuals in Figure 1 below. For 
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comparison, the circles show data from actual observations on the 
distribution of income, taken from page 253 of the Statistical Ab- 
stracts for the United States, 1946. In view of the objection raised in 
loc. cit. the agreement is not to be considered as too significant. It 
emphasizes, however, the need for a study suggested at the end of 
chapter ix of loc. cit. 
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Leadership, whether executive, political or any other type, is con- 
nected with the achievement of some goal by the social group through an 
appropriate organization of that group. From this point of view differ- 
ent leadership ranks in a group would be assigned to individuals ac- 
cording to their ability to organize the group for the purpose of reach- 
ing a specific goal. The situation is actually complicated by the circum- 
stance that an individual may have the necessary ability but may not 
like the responsibility connected with the leadership, or vice versa. Also, 
he may not be interested in the goal. The suggested mathematical ap- 
proach is to consider that the satisfaction of an individual is a function 
of his leadership rank, of the goal, and of several other parameters. If 
each individual tends to adjust his position in society so as to maximize 
his own satisfaction, this condition gives us the equations which deter- 
mine the leadership rank of each individual. It is found that, in gen- 
eral, the rank of an individual depends not only on his ability, but on 
the abilities of all other individuals. The method enables us to calculate 
the distribution function of abilities among individuals of a given rank, 
and leads to results which allow, in principle, experimental verification. 


In our previous studies of mathematical biology of social be- 
havior (Rashevsky, 1948; 1950) we discussed possible mechanisms 
of change of behavior of a social group as a whole. For the particular 
case of only two possible mutually exclusive behaviors, A and B, we 
considered that the tendency, ¢, toward one or another of those behav- 
iors was distributed continuously within the population. Individuals 
with ¢ = 0 have no preference toward either A or B. Individuals with 
¢ > 0 prefer A, those with ¢ < 0 prefer B. Individuals with high 
absolute values of 4 act for the most part according to their innate 
tendency ¢, and their behavior is very little affected by imitation. 
Individuals with small absolute values of ¢ are, on the contrary, act- 
ing primarily by imitation. It was shown (Rashevsky, 1950; Lan- 
dau, 1950; Landahl, 1950) that owing to imitation the majority of a 
social group will, in general, choose either behavior A or behavior B, 
even though the distribution of ¢ is symmetric with respect to ¢=0. 
Which of the two behaviors is adopted is a matter of chance. 

Changes in the behavior of a social group occur when the dis- 
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tribution function of ¢ changes so that the average value ¢ of ¢ 
shifts sufficiently in one direction or another. In this picture there 
is no manner in which the single individual can influence the behav- 
ior of the group. This point of view leads to historical determinism 
which is explicit in our previous studies. Yet, within limits, individ- 
uals do affect the course of some events and do control the behavior 
of more or less large social groups. How do these cases fit into the 
above-mentioned theory of social behavior? 

If a social group adopts a particular behavior; for example, A , 
we may perhaps consider that the few individuals with the highest 
positive value of # will control the situation and will be the leaders. 
Such a picture, however, is too oversimplified, and does not take into 
account the following situation: There is a difference between the 
adoption of a behavior pattern and the ability to have this pattern 
properly organized. For instance, society may adopt a behavior pat- 
tern consisting of belligerent activities. It will extol militarism and 
begin aggressive wars. But it may not possess any military leaders 
of sufficient talent to insure victory in such wars. Or the accepted 
behavior pattern may be that of peaceful production of some goods. 
But in the absence of individuals who can properly organize such a 
production, no beneficial results are obtained. 

Individual leadership pertains not to the general behavior pat- 
tern, but to its internal organization with a definite purpose in mind. 
In a sense, any theory of individual leadership should be developed 
on the basis of a general theory of biological and social organization 
(Rashevsky, 1946). Since at least some of the activities of mankind 
are purposeful, therefore mankind has organized certain hierarchies 
of leadership to expedite those purposes. These hierarchies are dif- 
ferent in nature from the more general hierarchies studied previously 
(Rashevsky, 1950) which are established due to biological mech- 
anisms in which purpose does not need to play any part. 

Viewed from this angle, it would seem at first that the leader- 
ship of an individual will be determined by his ability to organize 
and direct certain tasks, and that in this hierarchy of leadership the 
top leader will be the one with greatest abilities, the next one with 
next best abilities, and so forth. That such is not actually the case 
is shown by everyday life, especially in the field of political leader- 


ship. Mediocrities occur also in leading positions in industries and 
other private organizations. 


If we look for another plausible factor which may determine the 
leadership rank of an individual, we find that such a factor may be 
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the desire of the individual for leadership. A person may have all 
the abilities necessary to organize a certain group task. But the re- 
sponsibilities and troubles connected with such organizational work 
may be very unpleasant to him, and he will not want a position of 
leadership. On the other hand, an individual of mediocre abilities 
may crave a leading position to such an extent that he may obtain 
it by sheer vociferation. This sounds ludicrous, but it actually occurs. 

Another factor which must be considered is the desire of indi- 
viduals for a particular goal which can be achieved by organization. 
An individual with high organizational abilities may be quite indif- 
ferent toward that goal. Another individual with lesser abilities may 
have a very strong desire for that goal. If most of the group desires 
that goal too, the less capable enthusiastic individual may become a 
leader, rather than the more capable indifferent one. 

In dealing with problems of motivation, is it useful to introduce 
the concept of satisfaction function (Rashevsky, 1948; Rapoport, 
1947a,b). Therefore we shall introduce it here. 

The satisfaction function S; of the ith individual in a group is, 
in the present case, a function of his leadership rank r; and of the 
measure x of the achievement of the desired goal. The rank 7; may 
be an integer, assigned to each individual, thus: first in rank, second 
in rank, etc. Or it may be a continuous quantity, measured on an 
appropriate psychophysical scale (Rashevsky, 1950). The quantity 
# may represent anything, for example, the total industrial produc- 
tion by the group, the number of schools, or the quality of schools 
measured on an appropriate scale. The quantity x is the same for 
all individuals of the group, but the desire for it varies from person 
to person. 

The satisfaction S; may, in general, contain a number of para- 
meters J‘, where the subscript refers to the individual, the super- 
script to the nature of the parameter. Thus 4, 42", As) +++» are 
values of the same parameter for different individuals, while 4%, 
7, +--+ are values of different parameters for the same individual. 
The satisfaction S; of the ith individual may, in general, depend not 
only on the 2;’s, but also on the values 7; of the parameters for 
other individuals. Denoting by N, the total number of individuals, 
and by m the number of different parameters 2, we thus have 


S;=Si(ri, ej de); h=1,2----NoypHl1,2----m. (1) 


Each individual may act either egoistically or altruistically, as 
well as intermediately (Rashevsky, 1950). In the first case each in- 
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dividual tries to maximize his own satisfaction by trying to choose 
the appropriate 7;. We thus have 


OSi , 

Or; ; (2) 
together with the other standard conditions for a maximum. 

Different individuals have different ranks 7; of responsibility 

and importance in an organization. The quantity x depends on the 
abilities of all the individuals concerned, as well as on the corre- 
sponding ranks 7;. If two individuals of different abilities inter- 
change their leadership ranks, x will change. The quantity x may, 
in general, also be a function of the 4,’s. Thus we have, with 
B=1,2----Nop pH=1,2----m: 


x=f(a, Aay**, Gy 3 Mi, Tey °0*, Yn, 3 An) « (3) 


Introducing (3) into (1), and then the latter into (2) we find Np 
equations for the determination of the N, values 7r;. From them 
we find: 
tee U4 (Cage Aw dee k=1,2----No; 
4 
p=1,2----m. ( ) 

If 7; is an integer by definition, then the integers nearest to the 
values given by equation (4) are to be taken. 

Thus we see that the leadership rank of an individual depends 
not only on his own abilities, but on the abilities of all other individ- 
uals. 

If each individual acts altruistically (Rashevsky, 1950), he 
will try to maximize S = >> S:. We then have instead of equations 


v 


(2), the following one: 
hy 
— = 0) 
Or; 


(5) 


Introducing (3) into (5), we again find N equations for the de- 
termination of the ‘N ranks 7;. Similarly, the most general case 
(Rashevsky, 1950) is treated in which the ith individual maximizes 
the linear form 


Kis Si + Ieig Sp + +++ + Keay, Sy, (6) 


If Si depends on 7; only, then 0S/er; = 8; /er; and egoistic and 
altruistic behaviors lead to the same result. It is, however, quite 
natural to assume that S; depends not only on 7; but also on the 
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other 7;’s. A person may be pleased or displeased to see a particu- 
lar person occupy a definite rank. In such cases egoistic and altruis- 
tic behaviors lead, in general, to different results. The comparative 
study of the different behaviors offers an interesting problem. 

We shall illustrate the above general method on two special 
cases. First let us put 


S; = 913 log (7; =F x) ein ’ (7) 


where a; and f; are parameters. 

Without loss of mathematical generality we may put /; = 1, 
though psychologically it makes a difference whether an individual 
likes the quantity 7 more, or dislikes the effort to get that quantity 
r less. 

In choosing the function f we must consider two possibilities. 
The rank may be measured in relative units. The individual who 
has the highest rank is relatively the most important and respon- 
sible one. In this case, f must be a homogeneous function of zeroth 
degree in the 7;’s, because increasing all the 7’s times does not 
change anything, and leaves x, and hence f, unchanged. On the 
other hand, we may consider absolute rank. This may, if convenient, 
be normalized so as to vary within the range 0, 1. An individual 
with rank 0.9 is not only relatively more responsible than an indi- 
vidual with rank 0.7, but actually has greater power. In a group in 
which the highest possible rank is 0.7, the highest ranking indi- 
vidual has less executive powers than an individual with rank 0.8 in 
another group, though in that other group the highest rank may be 
even higher, say, 0.95. 

We shall consider here only the case of absolute rank. In that 
case we may choose the function f in equation (3) so that 

4D UTi- (8) 

This is a plausible enough assumption. It means that the higher the 

ability of an individual with a given rank, the better the result. More 
generally we may put: 

C= DA, 04 Ts (9) 


or 
=> (a; + a3) 17%. (10) 


Let us choose expression (9) as an example. Introducing it into ex- 
pression (7), and then the latter into (2) we find 
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a(l+ aia) 9. (11) 
T, + > Uy Ox 1% 
k 


This gives 
7, =a; (1 + a; 01) —D Oe Oo Tee (12) 
k 


The expression © a a 7, on the right side which is nothing other 


than x, does not Hepend oni. It is the same for all individuals, and 
may be treated as a constant, independent of 7. Hence equation (12) 
may be written 


7, = 0;,(1 + a; a;) et Pini (18) 
Introducing (13) into (9) we find: 
xs=> (hn ax Lo, (1 + Ak ax) —z], (14) 
k 


or 


D> Ox an? (1 + ax ak) 
k 


= (15) 
1, Se a Ak Ak 
k 


Hence, introducing (15) into (18), we find: 


> x ou? (1 + Ax ax) 


vr, = a;(1 + a; a;) — (16) 
ilse >, Ak Ax 
kK 


For very large values of N, we may substitute integrals for the 
sums. The rank r then becomes a function of the continuously dis- 
tributed a and a, and is itself a continuous quantity. Let Ni(a) and 
Nz2(a) be the respective distribution functions of a and a in the so- 


cial group, and let a also be normalized so as to vary from 0 to 1. 
Then: 


N, fac 4 nade Ni (aN elas 
r(a,a) =a(1 + aa) — : : 
LEN, f aa N,(a) N2(a) dada 


- (AG) 


If both N,(a) and N.(a) are normal distributions with aifiel: 
ently small standard durations o, and o., and with medians a* and a* 
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hear a = 3 and a= 4, then the integrals in (17) may be evaluated 
with good approximation by extending the range of integration from 
— «© to + o. Then 7 becomes a function of a, a, of the standard 
deviations o, and o,, and of the medians a* and a’. 

As another example we choose: 


S,=a;logr.—r, + Blogz, (18) 
with either 
w= Oh Ox Bu Tk, (19) 
or 
se == (1 dy + EPx) 1x. (20) 


Using expression (19) we find now from equations (2): 


Sie aeigie at Biee = (21) 
V5 D U% ox Be Tx : 
k 


or, remembering (19), 


ae (22) 
=, 
B= Oh; Ox Bi? 
Introducing (22) into (19) we find the following equation for the 
determination of x: 


Oy Ox” Br 


wc Zo 
k LAMINAE TT ( ) 


If N, is very large, and if N;() is the distribution function of f, 
while N,(a) and N2(a) have the same meaning as before, then we 
may write instead of (23): 


“aa? f N,(a) Ne(a) No(B) da dad 
p || Oe ee is 


Pie. 32 


In general this is a transcendental equation in 2, and cannot be 


solved in closed form. 
Consider now expression (17). It is, as we have seen, of the 


form: 
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r (a,0) =F(a,0;01,02,0, a’), (25) 


where the parameters o,, o2, a* and a* are characteristic constants 
for a given social group. Consider a specified constant rank 7. The 
equation 


r=—F(a@,0;01,02,0",a°) (26) 


determines a curve in the a, a plane. a 
Omitting, for brevity, the constant parameters, and retaining 
only r, we may write the equation of the curve as follows: 


a=V(a,r). (27) 
Put 
Se  ) oe 
0a or 


If instead of 7 we consider a value r + dr, we find instead of 
the curve (27), an infinitesimally near curve, whose distance from 
the curve (27) in the direction of the normal is 

V,(a,7) 
dy = —————_——-_ dr. (29) 
V Lerche (Gore 


The length of the element ds of the curve is given by 


ds= 1+ [V.(a,7r)]2da, (30) 


so that an element ds dy of area between the two infinitesimally near 
curves is equal to: 


ds dv=V,(a,7r) dadr. (31) 


We may now ask for the relative probability of a given value of 
a in the strip between the two curves. This will give us the distribu- 
tion of a for individuals whose rank is between r and r + dr. This 
ra in general, different from the distribution of a in the whole popu- 
ation. 


The relative number of individuals with a and a lying within 
the area da da around the point a,a is 
N,(a) N2(a) dada. (32) 


If we confine the area ds dy to the infinitesimal strip discussed above, 
ais fixed by (27) for a given a, and we must substitute V(a, 7) for 
a, while for da dx we substitute ds dy as given by (31). We thus 
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obtain for the distribution function of a for a given interval 
(eur -- dr) 


N,(a) =N, (a) N2[V(a,7r)] V,(a,7r)dadr. (33) 


This is the relative frequency of individuals with an a between a and 
a + da, if their rank is between r and r + dr. 

If instead of expression (17) we have the expression (22) in 
which there are two parameters, a and 6, we can use the same pro- 
cedure, referring now not to a curve, but to a surface. 

If we assume a given expression for S, say, either (7) or (18), 
or else some still different one, we can in principle determine the dis- 
tribution function N,(a), N2(a), etc. for the whole population. Then 
for every assumed form of S we obtain a definite expression for 
N,(a), which again in principle can be checked. Thus the assump- 
tion of a particular form of S may be verified. 

If the rank 7 is very high, then there may be only one, or very 
few, persons in that rank. Expression (33) then gives us the rela- 
tive incidence in time of a given value of ability a in leaders of such 
a high rank. 
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AN APPLICATION OF THE THEORY OF NEURAL NETS TO 
THE STUDY OF MOTION SICKNESS 


H. D. LANDAHL 


COMMITTEE ON MATHEMATICAL BIOLOGY 
THE UNIVERSITY OF CHICAGO 


The results from a study by M. F. Morales on the labyrinthine 
response to motion are used, together with results of neural net theory, 
to enable one to calculate, in terms of a small number of parameters, 
the percentage of individuals who have motion sickness when exposed 
to rather general patterns of motion which may vary with time. 


In a paper by M. F. Morales (1946), a theory has been dis- 
cussed which makes it possible to estimate with some success the ef- 
fectiveness of the spatial and temporal pattern of a given periodic 
motion in producing motion sickness. The present note is an analytic 
treatment of the dose-response problem for this phenomenon. 

Let the average frequency of the action spikes produced by the 
labyrinthine receptors during the cycle of the motion be denoted by 
y — y,, where is the value in the absence of motion. The value of 
y (« of Morales, 1949) shall be considered to be given in some arbi- 
trary units by means of a mechanism such as that considered by M. 
F. Morales (1946) from a knowledge of the form of one complete 
cycle of the imposed wave. For any given pattern, » would be pro- 
portional to the greatest value of the acceleration in any cycle. If e 
and j are measures of excitatory and inhibitory factors at some cen- 
ter which produces a particular response under observation, then we 
may expect the following relations to hold, if the simplest possible 
connections from the labyrinthine receptors are postulated (Rashev-. 


sky, 1948): 


—=— Ay®"— ae, (1) 
dt 
. 
os spy bf: (2) 
dt 


A,a,B,b, and n are constants, 2 being close to one. The 
353 


where 
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term »” is introduced to allow for a possible non-linearity between 
the quantity E’ (Rashevsky, 1948, p. 357) and ». 

We next introduce the rather reasonable postulate that, for the 
effect to be produced, « — 7 must exceed the threshold value h which 
is distributed over the population according to the distribution func- 


tion p(o log h/h) where h is the median value of h, and where 


1 
p(é) =—— ee? , (3) 
V 2n 
We define 
P(a) = f pads. (4) 


Thus for any value a, P(a) may be found from a table of the area 
under the normal error curve, e.g., if the values in the table range 
from zero to 4 as the deviate moves from zero to infinity, P(a) is 4 
plus or minus the tabulated value, depending upon whether or not 
the value of « is positive or negative. 

We shall now consider a few special cases. 

1) Let the accommodative effect be negligible, Ley B <<7AR 
Then we have the following result for the case of a constantly main- 
tained motion producing a constant average ». The percentage in 
the population showing the effect under observation, e.g., nausea, 
will be given by P,(y,t): 


1 a 
P.(v,t)=P| “logo (1— ey /m | (5) 
Co 
(vy=constant ,A >> B), 


where »* = ah/A, and y is thus the value of » which produces the 
effect in one-half of the population when the motion continues for a 
long time. 
2) For the Same conditions imposed upon the parameters as 
in the preceding case, consider a constant motion maintained for a 
time t,. We then ask: How long will the effect last after t,? After 
a time t, we have « — j given by 


e(t,) — f(t) =<» (Pee (6) 


so that after t, we have, if ¢ is measured from t,, 
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(7) 


shill 
Seale (lie 628 ) tea. 


Now if « — 7 in equation (7) is set equal to h, then ¢ will be equal 
to some value z, the median duration of the effect. Solving for 7 we 


(8) 


obtain 
Sy Sal = 
T = Oe [Ay (t— Ee") /ah]. 


For sufficiently short exposures the median duration of the response 


(9) 


is given by 


tT 


1 se 
=-log Ar" t,/h. 
a 


It may be that there is a minimum time 7, in which the effect can 
occur. Such a quantity may have to be introduced, especially when 


7 is small. 
3) If the accommodation cannot be neglected, then we obtain, 


for the case of a constantly maintained motion, 


P, (y, t) = 
P ae Ay" (1 — e**) /a— By (1 — e*") /b (10) 
| o “rn (B/A)Y@)) /a — By™{1 — (B/A)/@}/b 5 


Equation (5) or equation (10) then determines the percentage re- 
sponse as a function of the intensity and duration of the exposure 


100 V=3V, B=0 
) v2, = 2 Mp2= Sb 
ie nee es ea V=2V, B=0 
/ x e 
80. 1 ye : 
! Sea 
! any 
' XN 
2 60 : - xs 
on ‘ S a 
= ! : Ve2v, he %, a= 5b v=V, “h=2%,,a=5b 
z ! px. NM agp atetecettteerorest ere cres ses ree ccecce es = 
o) a7 aa ii aa eerie v=V, B=0 
Aol eat : » A 
rm) ei) ‘ 
ao ey) i \ F s 
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1] ‘i “D4 ae 
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356 STUDY OF MOTION SICKNESS 


to the motion. The type of relation expected is illustrated in Figure 1 
for several values of the parameters. 


Evaluation of Parameters. First let the exposure times be suf- 
ficiently short so that bt << 1. Then for any time of exposure, the 
per cent of the population showing the effect should be approximately 
represented by expression (5), and thus a plot of the per cent against 
intensity of the motion (v7) on log-probability paper should result 
in roughly straight lines of slope s = o/n, e.g., 


o/n=4 log [vy (84%) /y (16%) ] - 


For very short times of exposure and large intensities, one should 
find »"t = constant. Thus if P is plotted against t under these re- 
strictions, approximately straight lines should be obtained, each with 
slope o, e.g., o = 4 log [¢(84%) /t(16%)]. Thus the ratio of « tos 
isn. Now vis obtained, by interpolation if necessary, as that value 
of » giving 50% response atthe time when the response is maximum. 
If the accommodation ratio (B/b)/(A/a) is a small fraction, or if 
b <<a, P,(v) =4 fort >> 1/a, butt <1/b. 
If we set P, = 4 then 


— y/(1 —et)jin (P= 


bol 


). (11) 


Since » and n are estimated, this plot determines a approximately 
[cf. also equation (9)]. Estimation of (B/b)/(A/a) and b is more 
difficult. Some idea of the value of these parameters might be ob- 
tained from recovery data. A more direct method would be the appli- 
cation of a test increase in intensity after various durations of known 
constant intensities (Landahl and Kahn, 1948). Another possible 
method would be that of application of periodic intensities, e.g., equal 
periods of motion and rest. 

We will briefly mention some possible complications. It may be 
that a Single time constant in the production of the observed effect is 
inadequate. This would mean that an €, and e, with corresponding A, , 
a,, Az, d must be introduced. This only introduces an additive term 
in the numerator and denominator of the argument of the function P. 
For example, if the accommodative effect were negligible, the expres- 
sion for the percentage would be obtained by replacing B/b by 
—A'/a' and b by a’ in the argument of the function P. A more difficult 
problem arises, however, if we attempt to consider what occurs if 
parameters other than n are variable over the population. For ex- 
ample, some individuals might show considerable accommodation 
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while others show none. In this case we could perhaps neglect the 
variability of h and consider the relative accommodation to he dis- 
tributed approximately normally over the population. 

If the parameters have been estimated and the various predic- 
tions are approximately fulfilled, it would be possible to estimate the 
percentage of a population showing an effect, e.g., nausea, after the 
individuals’ exposure to a known variable intensity »(t). This is 
given by the expression 


P3{y(t) }= 
= <t 2 
[ew | y"(t)e" dt — Be* { y(t) at| | 
1 is) 
P| log 


2 as ° 
o  Av"{1— (B/A)¥}/a— Byrn — B/ayyery7e | 


If the expression is to give results after a possible decrease in P, i.e., 
the duration of the effect, then one may have to allow for the time 7 
mentioned above, if it is appreciable. In principle, then, for any form 
of motion which may vary in time, the percentage at any given time 
showing the effect under observation can be calculated. To what ex- 
tent these results can be made to agree with observation remains to 
be determined. 

The author wishes to express his appreciation to Dr. M. F. 
Morales for reading and discussing the manuscript. 

This investigation is part of the work done under Contract No. 
AF 19(122)-161 between the U. S. Air Force Cambridge Research 
Laboratories and The University of Chicago. 
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_ In the first part of the paper a general discussion of the trans- 
mission of information through neural chains is given in terms of the 
Shannon-Weaver theory. It is pointed out that with the all-or-none 
law a single chain of neurons connected in series transmits one bit of 
information per signal. A set of N independent parallel chains trans- 
mits N bits per signal. If, however, the chains are interconnected, the 
amount of information is reduced. At the same time, however, the de- 
gree of coordination of the final neuromuscular reaction is increased. 
A relation between the maximum possible speed of a reaction and its 
degree of coordination is derived, and possible applications to spoken 
language are suggested. A general quantitative discussion of the rela- 
tion between amount of information and amount of knowledge which 
an individual may obtain when confronted with the external world is 
made and a possible connection with new trends in logical thinking is 
pointed out. 

In the second part transmission of information through “social 
chains” is discussed under certain special assumptions. An expression 
for the “social channel noise” in terms of the length of the channel is 
derived. Finally an expression is given for the amount of information 
transmitted from one individual to another in a social group of uniform 
density as a function of the physical distance between the two indi- 
viduals. 


The possible importance of the mathematical theory of communi- 
cation for biology and sociology has been discussed in recent publi- 
cations (Wiener, 1948; Shannon and Weaver, 1949). The purpose of 
this paper is to discuss some possible applications more specifically 
than has been done hitherto. In our discussion we shall follow the 
presentation and notations of C. Shannon and W. Weaver (1949). 
We shall first mention in a most general way some applications to 
the biology of the individual, then we shall study more specifically 
a problem of communication in social groups. 

Eins, 


OX 


J. The Individual. 


The overt, observable behavior of an individual is manifested 
in the vast majority of cases by a set of more or less complex motor 
reactions, which may be, and are frequently, used as channels of 


359: 


360 MATHEMATICAL THEORY OF COMMUNICATION 


communication. Perhaps the most important of those motor reactions 
is speech. The relatively more rare non-motor reactions are glandu- 
lar secretions, such as sweating, salivating, and weeping. Of those 
only the latter is used as a channel of communication, and a rather 
important one at that. Confining ourselves to motor reactions does 
not entail any loss of generality. 

Individual elementary motor reactions may be considered as sig- 
nals or symbols, in the same sense as used by C. Shannon. They are 
caused either by external stimuli, or by internal stimuli or drives, 
which originate endogenically in the brain. In either case there is 
a transmission from the site of the stimulus to the motor ending. Dif- 
ferent regions of the brain thus act as communication channels. 

Assuming the general validity of the all-or-none law, we see that 
a neuron can either only fire or not fire. Thus it is capable of trans- 
mitting exactly one bit of information per signal (Shannon and 
Weaver, 1949). A simple chain of neurons in which all thresholds are 
sufficiently low, so that the firing of a preceding neuron always re- 
sults in a firing of the subsequent one, also transmits exactly one bit 
per signal. If » is the frequency of firing of each neuron in the chain, 
the rate of transmission C of information by such a simple chain is » 
bits per second. The frequency » is limited by the duration of the 
refractory phase of the neuron, and its highest possible value is of 
the order of magnitude of 10? sec?. Hence the maximum channel 
capacity C of a simple chain of neurons is about 102 bits sec*. If 
we have N independent parallel chains, then the channel capacity 
of such a system is of the order 102N bits sec*. 

If the individual parallel chains are, however, interconnected by 
either excitatory or inhibitory links, the capacity of the channel is 
decreased. We shall illustrate this on the following examples. 

Consider first two parallel chains. The amount of information 
transmitted per signal is, as we have seen, exactly two bits. Let the 
two chains, however, crossinhibit each other, so as to form the well- 
known circuit, studied by H. D. Landahl (1987) (cf. also Rashev- 
sky, 1948). Let us first consider the case in which the random fluc- 
tuations of excitation and/or thresholds are absent. Then, with a 
proper choice of the parameters involved, the circuit can produce 
either one of the two reactions R, or R,, or none, depending on the 
intensities S, and S, of the stimuli. We shall denote the latter case 
symbolically by R,- R.. This circuit cannot produce both reactions. 
Now we have three possible signals: R, ; when S;.— S20. Ae Ree 


when S, — S, > h; and R,- R,, when |S, — S.| < h. Hence the in- 
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formation per signal is now log, 3 = 1-58 bits. Now consider the 
general case in which fluctuations are present. We still have the 
same 3 possible signals, but their choices are not quite free. Even 
if we make S, — S, > h, there will be a probability P,, of reaction 
RF, and a probability P. of no reaction at all. Thus even if the choice 
between S, — 8S, > h, S, — Si > h, and |S, — S.| < h is equally 
probable, and equal to 1/3 for each choice, the signals R,, R., and 
R, - Rs will not be equally probable. Hence the information per sig- 
nal will be less than log, 3 (Shannon and Weaver, 1949). The ac- 
tual amount of that information may be computed in the following 
Way: 
We shall first restrict further the assumption that the choice 
of either of the three inequalities 


ae De Oa ee [Si —S:| <h (1) 


is equally probable, by considering the case in which there is an 
equiprobable choice between 3 pairs S,, S. of fixed values of S,; and 
S,, the three pairs satisfying respectively the three inequalities (1). 
Denote by P,, P, and P; the probabilities of reaction R,, R, and 
R, - R, respectively, for the case S, — S. > h. Those probabilities 
are given by expressions developed by H. D. Landahl (1937), and 
are functions of S, — S.. Similarly, denote by P’,, P’. and P’; the 
respective probabilities of R,, R. and R,- R,, when S, — S, >h, 
and denote by P”,, P”, and P"; the same probabilities for |S, — S,| 
< h. Now if we denote by ,, »2. and p; the probabilities respec- 
tively of R,, R., and R, - R., when S, — S, has any of the three 
possible values, and if we remember that inequalities S, — S. >h, 
S, — S, >h, and |S; — S| < h are equally probable, then we find 


=f ( Pee Pa P's); 
Pea Pe Pe Ps) 3 (2) 
pe —=+ (P2+P3+P"s). 


The information per signal, or entropy H of the system, is 
H =— (p, log p; + pe log pz + Ds log ps), (3) 


and is a function of the chosen three pairs (Si, Sz). 

This result may be generalized by considering the probability 
Q(A)dA that the difference A = S; — S, will lie in the interval be- 
tween A and A + dA. It is possible to find such a function Q(4) 
that the probabilities of inequalities (1) will be all equal. This re- 
quires that 
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“@(a) d4= f- Q(A) dA, (4) 


-h 


few dA= 


and there is an infinite number of functions Q(A) which satisfies 
that requirement. We now denote by Pi(4), P,(A), and P,(A) the 


probabilities of R,, R., and R, - Ro respectively, when A has a given 
value. Instead of (2) we now have 


p= { Py(4) Q(A) da; 
p= [ P.(A) Q(A) da; (5) 


Ds = IE P,(A) Q¢A) dA. 


The entropy is again given by expression (3) and depends on the 
distribution function Q(A). 

A cross-excitation between parallel neuron chains also acts to 
decrease the information per signal, by reducing the independence 
of choice of the individual signals. 

Voluntary motor reactions are mediated mostly through the py- 
ramidal tract, and partly through extrapyramidal fibers. The total 
number of fibers in the pyramidal tract is estimated at about 10°. 
If they are all independent, this would give a possible maximum 
channel capacity for the pyramidal tract of about 10’ bits sec7. 
Whether a reaction is caused by an external or internal stimulus, in 
either case the pyramidal cells constitute one of the last steps in 
transmission. The transmission from stimulus, whether external or 
internal, is done through some as yet unknown region of the brain. 
Since neither the number of cells involved, nor their connections is 
known, the channel capacity cannot even be guessed. We shall de- 
note that channel capacity by C*. 

Now consider any set of motor reactions. Each reaction may 
be considered as a signal. If they are independent, and if there are 
nm such reactions, the information per signal is equal to log. ” bits. 
In general, the different signals are not independent, and this re- 
duces the information H per signal. Thus, for example, a flexor and 
a corresponding extensor cannot contract simultaneously. If in a 
reaction pattern all reactions are completely independent, we may 
characterize such a pattern as physiologically completely uncoordi- 
nated. No prediction can be made about a subsequent signal from 
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the knowledge of preceding signals. If a certain degree of coordina- 
tion is introduced, then certain more or less fixed patterns appear. 
By observing the beginning of a coordinated movement it is possible 
to predict with considerable probability its future course, at least 
up to a point. Physiologically the coordination is due to neural con- 
nections at higher neural levels. It reduces the number of possible 
reactions. From the point of view of the communication theory co- 
ordination reduces the entropy of the reaction pattern by reducing 
the independence of each reaction. The more coordinated or rigid a re- 
action pattern, the less information it can convey per signal. If H* 
denotes the entropy per signal of a reaction pattern, and H,, the maxi- 
mum entropy per signal which the pattern could have, if completely 
uncoordinated, then we may define with C. Shannon the quantity 


H* 
fT. m 


r=1— (6) 
as the redundancy of the reaction pattern. From what has been said 
above, it is natural to consider the redundancy rv of a motor reaction 
pattern as a measure of the degree of its coordination. It is diffi- 
cult to see at present how the redundancy can be measured for some 
very general types of motor patterns. It can, however, be measured 
for a very important type of patterns, namely, speech. (Shannon 
and Weaver, 1949). This possibility opens the way of verifying some 
interesting conclusions. 

If C* is the capacity of the central channel in bits sec, and H* 
is the entropy of the reaction in bits signal", then the maximum rate 
of transmission Tmax of signals is 

T max — —- Signals sec?. 7). 

WS (7) 

The average time é between the signals would be given in that case 

by H*/C*, if the speed of the reaction were limited only by the speed 

of transmission. Actually there are some purely physical limitations 

to the speed of muscular reactions. Those limitations set a lower 

limit + to the average time between signals. Hence, with a better 
approximation, we have: 

H* 

Ce 

The maximum actual rate of performance of reactions per second 

is, therefore, 


t= 7'+ (8) 
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(9) 


Solving equation (6) for H* and introducing the solution into equa- 
tion (9) gives 
Un = sel aa (10) 
7C* + An(1—7r) 

When applied to speech, expression (10) establishes a relation 
between the measurable quantities Un, Hm, and 7 of a language. 
The greater the redundancy of a language the faster it can be spok- 
en. Expression (10) could be best verified on polylingual individ- 
uals, using the same individual for several languages, and then meas- 
uring the constancy of C* and 7. 


N2———- ———- ————- R\, 
N3 ———- ————— ———— R, 


Seer mrapemers ever sre=serrner Smee 
FIGURE la 
The neurophysiological basis of this phenomenon is clear. If 
the reactions R,, R., R;, R, in Figure 1a are mediated by independ- 
ent chains, then it requires the successive firing of the 4 central neu- 
rons, N,, N., N;, and N, to produce the sequence fy, Ro Ray Bee 
If, on the other hand, due to cross-connections, the firing of a single 
neuron N, results in the sequence R,, R2, R;, R, (Figure 1b), then 


Ze \—. 


an economy of time will be effected. Cross connections of that type 
may be viewed as a mechanism for coding. The firing of N, may be 


FIGURE 1b 
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considered as a code signal for the sequence R,, R., R;, R,. The 
cross connections may be either fixed and innate, or they may de- 
velop as a result of conditioning. If a sequence of initially inde- 
pendent reactions occurs sufficiently frequently, central cross con- 
nections of a conditioned nature (Rashevsky, 1948) will develop, and 
this will result both in an economy of central neurons necessary to 
produce the sequence and in an economy of time. Such an encoding 
has apparently developed in the English, and probably in other, lan- 
guages in the course of time, as evidenced by K. Zipf’s findings 
(1935) that the shorter a syllable the greater the frequency of its 
occurrence. Any process of learning a complex motion pattern re- 
sults in such a conditioning between the different elements of the 
pattern and in a resulting encoding and increase in rate of perform- 
ance. The first attempt to perform a new complex pattern results in 
slow performances. With learning, the speed increases. 

If we concede that thought is merely a covert, “unspoken” 
speech (Jacobson, 1931), then equation (10) may perhaps be ap- 
plied to the speed of abstract thinking. Solution of an abstract non- 
mathematical problem may take less time if the individual thinks 
in a language with greater redundancy. Whether this is so or not 
future experimentation may show. The theory proposed here def- 
initely suggests quantitative experiments along new lines. 

While the more redundant a language, the more rapidly it may 
be spoken and perhaps thought, yet the less information it carries 
per signal. If, as suggested elsewhere (Rashevsky, 1948, chapter 
xlii), the rational solution of a problem requires a transfer of as 
much information as possible from one field to another, then a re- 
dundant language may be a handicap. If the abstract language is 
not sufficient to carry all the desired information, the individual may 
supplement it with other motor reactions, such as are involved in 
visualization (Jacobson, 1930). It has been noticed sometimes that 
English speaking scientists use considerably more visual “models” 
in their thinking than, for example, German scientists. Typical in 
the use of “visual models” are, for example, Lord Kelvin and Max- 
well, as compared, for instance, to Plank or Einstein. If the redun- 
dancy of German is less than that of English, the above situation 
may be understood. Other cultural factors undoubtedly also play an 
important part. It may, however, be possible to establish a positive 
correlation between the redundancy of a language and the tendency 
to think in visual terms. This, of course, will not mean that the 
choice of the language is the cause of the thinking habit or vice versa. ° 
They both may be results of some common factor. 
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Now let an individual face a situation consisting of 1 possible 
mutually exclusive events, concerning the occurrence of which he 
does not know anything, and which therefore are equally probable 
to him. Using the terminology of the communication theory, we may 
say that in this case the amount of information which the individual 
can obtain from the situation is log, n per event, or n log, n for the 
whole situation. Gradually, as a result of observation or thinking 
or both, the individual acquires some knowledge about the events. 
He finds out that the probability of occurrence of some of them is 
9; > 1/n; the probability of occurrence of others is less than 1/n. 
His knowledge of the situation has increased. As a measure K of 
this knowledge we may conveniently take the expression 

K=43 (pi—3,)*. (11) 
V5) 
When all events are equally probable, K = 0, as should be the case. 
When one event, the rth,.is known with certainty, then p, = 1; 
p= 0 (i #r), and K =1. But as K increases, the information 
which the individual can obtain from the situation decreases. It is 
now equal to 


be erp ait logs pi (12) 
per event, or 
I=nH =— n> pi logs pi (13) 


for the whole situation. The more an individual knows about a finite 
situation, the less information he can extract from that situation. 
The statement sounds trivial, but the above equations give it a quan- 
titative formulation. 

Instead of expression (11) we may take as a measure of knowl- 
edge the difference between the maximum entropy, log» n, and the 
actual entropy H as given by (12). Thus we may put 


K= log, n — H = log. n + B p; loge p;. (14) 


Since H decreases when K increases, therefore an individual who 
strives for a large knowledge and yet also enjoys the process of ac- 
quiring information will have to compromise somewhere between 
maximum knowledge and maximum available information. If he 
maximizes K, then he will know with certainty one event, say the 
rth (p, = 1). Maximizing H he choses to know nothing about every 
event. He may choose to maximize the product KH. If K is given 
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by (11) then this amount to maximizing the expression 
L——+ > i—Dd;)* > 0; log p;. (15) 
aj a 


When all pi = 1/n, L = 0; it is also equal to zero for p, = 1; 
pi = 0 (1 # 1), and it is positive everywhere else. Hence there is at 


least one set of values p,*, p*,--+-, p,", for which L has a maximum. 
If we choose as measure of K expression (14), then 
L=KH = (log.n—H)H. (16) 
This has a maximum for 
H=log, Vn, (17) 
or 
—2 pi log pi =log, Vn. (18) 


Equation (18) determines a hypersurface in an (m — 1)-dimensional 
space. All points in that hypersurface have coordinates »; which 
maximize L. 

To allow for individual preferences for either K or H we may 
maximize the quantity 


M = K? H8 (19) 


where a and f are individual parameters, both greater than unity. 
Using expression (11) we may maximize a linear combination of K 
and H . This is impossible if we choose expression (14). 

We may consider the same problem from a different, perhaps 
somewhat more realistic, angle. Observation may show that the n 
events are neither all mutually exclusive, nor completely independ- 
ent. Thus we may observe that events a; and a sometimes occur 
separately, sometimes together. This results in a formation of an 
association between a; and a, in our minds. If a; and a; always occur 
together, then this actually reduces the number of independent events 
from n to n — 1. Moreover, if the probability of a; and a, remains 
1/n, then in the new situation of n — 1 events there are n — 2 events 
a, (r #1, k) with a probability 1/n, and one event a; + a, with the 
probability 2/n. This makes the entropy of the situation per event 
less than log.( — 1). 

Quite generally, the problem may be treated in the following 
way. 

3 Let the probability of a; , whether alone or in combination with 
a, be 1/n. Let the same hold for the probability of ax. Let pix’ 
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denote the probability that a appears when it is known that either 
a; has appeared alone or both have appeared. Let pxi’ denote the 
probability that a; appears when it is known that either a has 
appeared alone or both have appeared. In general we may have 
Dik = Pri. If pi’, respectively p;’, denotes the probability that a; , re- 
spectively a, appears alone when it is known that it has appeared 
either alone or in combination, then 


Di + Pix =1; De + De = 1. (20) 


Since the probability of a; appearing either alone or in combination, 
when nothing is known about it, is 1/n, while the probability of its 
appearing alone, when it is known that it has appeared either alone 
or in combination, is p;’, therefore probability p; that a; will ap- 
pear alone in the whole situation is 


wr Deel Dip 
Di -s goat (21) 
Similarly 
, 1 ae 7 Re 
i Wh se ginal (22) 
nN nN 


Finally the probability pi, that a; and a, appear together when it is 
not known that either one of them has appeared is given by 


ey aay 
Disk n . (23) 
The situation now consists of n + 1 possible events: 
Qy , Way **** On} G; + Oy. (24) 


Of those n + 1 events n — 2 events a,(r # i,k) have the probability 
1/n; the event a; has the probability p; as given by (21); the event 
a; has the probability p, as given by (22), and the event a; + a has 
the probability (p'ix + p’xi) /n. Hence the entropy is equal to 


n—2 heey Midis 
H= 108) nics eee oe eee 
n n 


Bn f 3 wae , Lg , , LZ 25 
ae P ki loz 1 5 Dae Oe, Dik + Dri Ke 
n n ‘ 


2 ee 
Me Fe 22 


When p’i, = p's; = 0, this reduces to 
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n—2 


i {f— 


2 
log. m + a log. n= log. n, (26) 


as should be the case, for we then have n equally probable independ- 
ent events. If p'i, =p; —=1, then 


2 
dl — logsn — “aeerlog, (nw — 1). (27) 


Thus if we know with certainty (p' ix = p's: = 1) that two (or more) 
events are associated with each other, we decrease the entropy of 
the situation. If our knowledge is not absolute (p'ix <1; pki < 1) 
then the entropy is reduced by a lesser amount. We can again at- 
tempt to define a proper measure K of knowledge and to con- 
struct such a function of K and HA that its maximum will determine 
the individual’s choice of an optimum -between greatest knowledge 
and greatest possible information to be obtained. 


It is an interesting and important problem to determine what 
function of K and H an individual actually tries to maximize and 
how K is expressed in terms of the p;’s, because this determines in 
a sense the individual’s “philosophy.” Maximizing K will result in a 
' certain knowledge of one particular constellation: the rth event oc- 
curs always, other events never occur. But this precludes any fur- 
ther discussion of any other constellation in the situation. On the 
other hand, when H is near its maximum, the individual’s knowledge 
of the single constellation is small, but the number of possible con- 
stellations is large. We have here, in a sense, not only a “mathemati- 
cal formulation,” but also an actual refinement of the joke which de- 
fines the specialist as one who knows everything about nothing, and 
the philosopher or diletante as one who knows nothing about every- 
thing. We see that the specialist must be defined as one who knows 
everything about only one thing. Perhaps the values of the p;’s which 
maximize either expression (15) or (16) may be considered as cor- 
responding to the behavior of the “ideal’’ scientist, who knows 
enough about a sufficiently large number of things, but who is never 
absolutely certain of anything, and therefore is willing to question 
everything he knows, and thus avoid dogmatism. The contemporary 
shift from the old single-valued Aristotelian orientation to the mul- 
tivalued probabilistic non-Aristotelian orientation illustrates the 
point. In a sense, the non-Aristotelian orientation may be justified 
by the further development of the theory outlined here. 


The preference for either a larger K or a larger H may be de- 
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termined not solely by personal predilections, but may be to some 
extent imposed upon us by external conditions. A practical physician 
knows that drug X is helpful for a particular dangerous disease. 
Though the beneficial results are not always obtained, they are ob- 
served in such a large percentage of cases that expediency definitely 
dictates the use of X for that disease. The physician will instruct a 
younger doctor in no uncertain terms—“If the disease occurs, use Di 
But, with this practical attitude, there is hardly any possibility left 
for discovering that another drug, Y, may be better in some cases 
than X. On the other hand, the scientist or clinician, while know- 
ing that X is practically the best remedy for the disease, will keep 
in mind that this knowledge is not certain. He will try other drugs 
on his patients, and is very likely to lose more patients in this process 
than the practical physician will with the use of X only. But in the 
long run the clinician has a chance to discover the more efficient drug 
Y. In other words, he can acquire more information by his method 
of approach which involves uncertainty. 

An intellectual individual learns as long as he lives. It is natu- 
ral to assume that the rate of increase of his knowledge K is the 
greater, the greater the available information. If we consider a lin- 
ear relation, we may write, with « as a constant, 


dK 
ar H. (28) 
If for K we use expression (14), equation (28) becomes 
dK 
ae x 0ee mK). (29) 


Integrated with the initial condition that for t = 0 (at birth), 
K = 0, equation (29) gives 


K= (1—e**) log. n (30) 
and 
dK _ We 
ies ak log. n. : (31) 


Since ¢ in this case is proportional to the age of the individual, we 
see that the rate of learning decreases with age. j 
The maximum amount of knowledge is given, according to (30), 
by log... The number n, while a characteristic of the environment 
is nevertheless to some extent determined by the individual. The 
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larger the scope of interest of an individual, the larger the number n 
of events to which he gives his attention. If n is very large the indi- 
vidual can “afford” fairly precise knowledge of some events without 
impairing materially the amount of information available to him. 
The quantity log, » may be considered as the measure of the indi- 
vidual’s “span of interest.’”’ The larger the “span of interest” of an 
individual, the larger will be his rate of learning at a given age. 


II. The Social Group. 

Consider a sufficiently large group of N, individuals which is 
characterized by properties discussed previously (Rashevsky, 1950, 
chap. xii). That is, each individual always exhibits one of two mu- 
tually exclusive reactions, R, and R., and each has a bias ¢ in favor 
of one or the other of the two reactions. The probabilities P, and P, 
of FR, and R, respectively are functions of ¢. Assuming with H. G. 
Landau (1950) a normal distribution of the random fluctuations with 
a standard deviation o, and putting 


dP it : “ong 
(pga fe Fe 
oC V 22 01 a 


p,=4( +), p,=1-4( +) 5 (33) 


For ¢=—0;P,=P,=}3. 

We consider furthermore that the reaction R, produced by an- 
other individual acts as a stimulus favoring R, in a given individual, 
while a reaction R, of another individual favors R, in a given one. We 
shall neglect here the variations in the intensities of the reactions, 
and consider both R, and R, as always of the same intensity S. De- 
noting by P’ the probability of reaction R,, when the other indi- 
vidual exhibits that reaction too, and choosing units for S properly 


we now have 
=F 
P=¢( 7 ) (34) 


O1 


we have: 


The probability P” of R, when the other individual exhibits R,, is 


given by 
Ks) 
prag(*— J. (35) 


O71 
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As we have seen at the beginning of this paper, such an indi- 
vidual may be considered as capable of transmitting a maximum of 
one bit of information per reaction. Suppose we have a linear chain 
of n such individuals. How much information can be transmitted 
per signal through such a chain? 

To answer this question, let us at first consider the very special 
case in which the biases ¢ of all the individuals are the same. De- 
note by p, the probability of the reaction R, by the nth individual 
when it is known that the zeroth individual has exhibited the re- 
action R,. The probability p, is the sum of the probabilities that Ri 
is exhibited when the (n — 1)st individual exhibits R,, and of the 
probability that R, is exhibited when the (x — 1)st individual ex- 
hibits R.. Since the probabilities of the (n — 1)st individual ex- 
hibiting R, and R, are respectively pn. and 1 — pn+, we have 


Pn — Pn-a da (1 Satay pee (36) 
or 
Di (PS Po) Deep — Be (87) 


This is a difference equation with constant coefficients. We seek the 
value p, for the case in which p,) = 1. Putting 


PP AS eB (38) 
the difference equation (37) becomes 
Yn —ADrii=—B. (39) 
Its solution, for p, = 1, is 
1 — A* 
n—A*+B 
p ae (40) 


We may similarly calculate the probability q, of a reaction R, by the 
mth individual, when the zeroth has exhibited R,. Denote by Q’ the 
probability of the nth individual exhibiting R,, when the (7 —" 1) st 
exhibits k,, and by Q” the probability of the nth individual exhibit- 
ing R,, when the (n — 1)st exhibits R,. We have 


Q=1—P’"; Q"=1-P. (41) 


‘ In a similar manner as that by which we arrived at (87) we now 
ave 


Qn = (Q' — Q") Gra =Q". (42) 
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The initial condition is now, however, q@ = 0. 
Hence, putting 
Q—Q"=A4'; Q"=B, (43) 
we have: 
=e: 44 
n ea: (44) 


Remembering (38) and (41) we find from (40) and (44), after re- 
arrangements, that 


Pn +Qnr—1, (45) 
as should be the case. 


Expression (40) is a particular form of the more general ex- 
pression 


1— A” 
Dn —p, A" +B 1 


= (46) 
which holds when p, > 0. We thus see that the first term of (40) 
depends in general on the initial reaction, the second is a character- 
istic of the “channel” only. As n tends to infinity, the first term of 
(40) tends to zero, because A < 1, the second to 


B Pe 
D voir ee 


= ————_- <1 (47) 
plies I ees om sail 


Thus the probability of a reaction becomes, for an infinite value of 
m independent of the initial reaction or signal. No information is 
then transmitted at all. We may naturally interpret the first term of 
(40) as representing the probability of R, due to transmitted infor- 
mation, while the second term of (40) may be said to represent the 
probability of R, due to “channel noise.’ The effect of the channel 
noise increases from zero to the constant asymptotic value pou 
given by (47). 
If the bias ¢ is zero, then, as seen from expressions (34) and 
(35), P’ + P”=1 and p.~ =}, a result which is intuitively evident. 
Putting 
B(1—A") 
Din =A 5). Don a ian A ? (48) 


we may write expression (40) as follows 
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Pn — Pin rie Pen ° (49) 


The probability pc, due to channel noise may in principle be cal- 
culated or determined empirically. When po = 1, q = 0, then the 
probability of g,, given by equation (44), is due to channel noise 
only. If the bias ¢ is zero, then, as we have seen above, P’ + P” = 1. 
Together with (41) this gives P” = Q”. Hence, because of equations 
(38) and (43) we have B = B’. From (38) and (41) we also find 
A=A’. It follows that for 6 = 0, den = Den, aS Should be the case. 
Regardless of the channel length 7, the contribution of the channel 
noise to R, and R, is equally probable for ¢ = 0. Suppose we receive 
at the output end M signals, of which we shall have Mp, signals FR, . 
Of those Mp, signals R, a number Mp,, is due to the channel noise 
and Myp;, is due to the signal R, at the input. For each of the MDen 
signals the a posteriori probability of R, at the origin is}. For each 
of the Mp;, signals this a posteriori probability is 1. Hence the av- 
erage a posteriori probability of R, at the origin when R, is observed 
at the end is 


a —_ eMden <* MPin =< Pen =f 2Din 
°° Min PMDin BP 


(50) 


As n increases, pin tends to zero, while p., tends to pn. Hence the 


a posteriori probability Dn tends to 4. 
The transmitted information per signal at the end of the chan- 


nel when at the origin it is equal to 1, is given according to C. Shan- 
non (1949), by 


Re = 1 ( pp logs p=! C= py) los) ye 


As 7 increases and p, tends to 4, the expression in braces tends to 
1, and RF, tends to zero. Introducing (34) and (35) into (38), then 
the latter into (48), then introducing (48) into (50) and finally the 
whe into (51), we obtain R, as a function of the length n of the 
chain. 

The above situation is, however, complicated by the fact that 
the values P’ and P” are in general different for different members 
of the chain, because different individuals have different biases ¢. 
In general instead of equation (39) we shall have 


Pn —_ Apes aa By . (52) 
Denoting by ¢, the bias of the nth individual, we have: 
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TS as 
4=6(2—=)—¢( 2") , 


O71 


(53) 


O1 


If A, and B, are definite known functions of n , then the general so- 
lution of (52) for p, = 1, is 


n Be 

Pra Ar-Ao(1+ 375"), (54) 
which reduces to (39) when A, = A; B,=B. Unfortunately the 
A,’s and B,’s are not known functions of n, because ¢@ varies at ran- 
dom from individual to individual. The simplest thing to do is to use 
equations (39) and (40) with constant coefficients, but introduce for 
A and B appropriate average values A and B. If N(¢) denotes the 
distribution function of ¢ we may put 


A= { Aw) N() da; B= Jf BUH) N() dg, (55) 


where 


a =o( = )-4(—); 


(56) 
¢—S 
B(¢) =G ——) : 
O41 

We thus see, that the transmission of information along a “social 
chain” depends on the distribution function N(¢). 

A somewhat more rigorous, but elaborate, discussion of this 
problem will be given in a subsequent paper. 

If the population density is sufficiently high, we may introduce 
into (40) the variable 


x=an (57) 


where a is the average distance between two adjacent individuals, 
and x the actual physical length of the chain or the distance between 
its two ends. Then R, becomes a function R(x) of «, and the equa- 
tion developed above gives us the amount of information transmitted 
through a chain of individuals as a function of the physical length 
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of that chain. 
We now have instead of (51) 


R(x) =1+p(a) log p(«) + [1—p(#)] logs [1—p(*)]. (58) 


Now we may compute the amount of information per signal that 
ean be transmitted from one individual to another, when the distance 
between the individuals is equal to 7 and when they both are mem- 
bers of a social group of constant population density. In such a case 
there is a large number of possible channels between the two indi- 
viduals, through which the information may be transmitted. There 
exists a shortest channel, whose length is close to 7. But there are 
many others, much longer ones, such as channels I, II and III on 
Figure 2. Some channels, as II and III, may have general links in 


FIGURE 2 


common. All these possible channels are connected in parallel, and 
therefore the total information transmitted through them is the sum 
of the information transmitted through each channel separately. The 
evaluation of the total number of possible discrete channels is a 
rather complex problem. We may, however, use a good approxima- 
tion, by passing to a continuous case. Consider the transmission 
of information between individuals 1 and 2 (Figure 3). Let ds = 
rdr'd be an element of area at the point 3, which lies on a circle 
of radius 7’. If 6 is the density of population, then the element ds 
contains ér’dr'd@ individuals. From the individual 1 to any one of 
the above individuals there is a shortest chain of length 7’. And 
similarly from each of those individuals to the individual 2 there is 
a shortest chain of length r”. Hence, there are altogether ér'dr'dé 
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FIGURE 3 


chains of length x’ + r” between individuals 1 and 2, going through 
point 3. The amount of information transmitted through those chains 
is 


dlI=R(r' +71") 6r'dr'de, (59) 
where R(r’ + r”) is given by (58), in which we put « = 7’ +r”. 
But 
yr" = \/7? + 72 — 2rr’ cos 6. (60) 
Hence 
dlI=6 R(r + Vr? + 7? —2rr’ cos 6) r'dr'dé. (61) 


The total information transmitted from 1 to 2 is obtained by inte- 
grating expression (61) with respect to 7’ and 6 over the whole pos- 
sible range of those two variables. The variables 7’ and 6 integrate 
out and the total information becomes a function of 7 only: 


I(r) =6 f i FOG 1 217 008 0) 7 CF Gr ( 62) 


The limits of integration in (62) will in general depend on the size 
and shape of the area occupied by the population. This is not sur- 
prising, since the number and lengths of possible channels depend on 
the size and shape of the area. Since R(x) decreases to zero expo- 
nentially, as seen from equations (40), (51) and (57), therefore 
when the two individuals are not too close to the periphery, and when 
their distance is small compared to the size of the area occupied by 
the social group, I(a#) will be practically independent on the size and 
shape of the area. It is even possible that the double integral may 
have a meaning for an infinite population of uniform density. 

It is readily seen that an individual receives more information 
when he is located in the center of a two dimensional area, than when 
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he is a member of a linearly arranged chain of individuals. 

In a previous study (Rashevsky, 1950) effects of distance be- 
tween individuals on their tendency to imitate each other were stud- 
ied. It is natural to assume that the imitation effect will be propor- 
tional to the amount of information received. In that case the ex- 
pression J(v) may well be used for the kernel K(r) introduced in 
the above mentioned study. 

The next step in the present study would be to generalize our 
results to the case of variable intensity of reactions R, and Rz, as 
well as to introduce thresholds, somewhat in the manner as has been 
done in connection with equations (5). The most important gen- 
eralization, however, would be toward more than two mutually ex- 
clusive, or non-exclusive reactions. 

This investigation is part of the work done under Contract No. 
AF 19(122)-161 between the U. S. Air Force Cambridge Research 
Laboratories and The University of Chicago. 
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